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Abstract 

In this paper, we use entropy functions to characterise the set of rate-capacity tuples achievable with either 
zero decoding error, or vanishing decoding error, for general network coding problems. We show that when sources 
are colocated, the outer bound obtained by Yeung, A First Course in Information Theory, Section 15.5 (2002) is 
tight and the sets of zero-error achievable and vanishing-error achievable rate-capacity tuples are the same. We also 
characterise the set of zero-error and vanishing-error achievable rate capacity tuples for network coding problems 
subject to linear encoding constraints, routing constraints (where some or all nodes can only perform routing) and 
secrecy constraints. Finally, we show that even for apparently simple networks, design of optimal codes may be 
difficult. In particular, we prove that for the incremental multicast problem and for the single-source secure network 
coding problem, characterisation of the achievable set is very hard and linear network codes may not be optimal. 

I. Introduction 

Determining network coding capacity regions (the set of link capacities and source rates admitting a network 
coding solution for a given multicast) is a fundamental problem in information theory and communications. Recently, 
the network coding capacity region for general network coding problems was implicitly determined using entropy 
functions 1 1 Q This characterisation has a similar (but slightly more complicated) form to the outer bound in J5] 
Section 15.5], which is expressed in terms of almost entropic functions. Explicit characterisation of the set of 
enttopy functions (or its closure) is however a very difficult open problem (for instance, it is known that this set is 
not polyhedral [3]). 

It is therefore natural to wonder whether there might be a simpler, explicit characterisation of network coding 
capacity regions which somehow avoid the use of entropy functions. However, these two problems are inextricably 
linked, and in general, determining network coding capacity regions is as hard as finding the set of all entropy 
functions, or equivalently, determining all information inequalities |4). 

One approach to avoid these intrinsic challenges of the general case is to seek special cases, or specific classes 
of networks for which an explicit, computable solution is possible. To date, only a few such special cases have 
been found. One notable example is where a single source data stream is unicast to multiple destinations. In this 
case, the capacity region is characterised by graph-theoretic maximal flow/minimum cut bounds, and linear codes 

'We assume the reader is familiar with polymatroids, entropy functions and representable functions. We review these and other related concepts 
in Section |n| and as needed throughout the paper. 
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are optimal |5|. As second example is a secure network coding problem when all links have equal capacities and 
the eavesdropper's capability is only limited by the total number of links it can wiretap. In this case, the minimum 
cut bound is also tight J6). 

Another approach is to develop computable bounds on the capacity region. Relaxation from entropy functions 
to polymatroids yields the so-called linear programming bound [2 Section 15.6]. Although this bound is explicit, 
both the number of variables and the number of constraints increase exponentially with the number of links in the 
network, making the bound computationally infeasible even for modest networks. Other works such as [7|, [8], [9] 
aim to obtain useful outer bounds with computationally efficient algorithms for their evaluation. In fact, it can be 
shown that all of the bounds obtained in those works are relaxations of the linear programming bound from [2 
Section 15.6]. 

This paper extends (H in several aspects. In both [1] and [2], vanishing decoder error probabilities are allowed. 



In Section III we will extend these results to the case where the decoding error probability must be exactly zero. 
We also prove that when all the sources are colocated, the outer bound Q is in fact zero-error achievable and 
tight. For the general non-colocated source case, we show that tightness of the outer bound reduces to a question 
of whether or not the addition of a zero-rate link can change the capacity region of a particular network that we 
derive from the original network. This leads us to conjecture tightness of the bound in general. 

The existing capacity result [1] does not place any constraints on the operation of intermediate nodes, allowing 
arbitrary network coding operations. We further extend this entropy-function based approach to three different 
practically-motivated cases where additional constraints are placed on the network: (a) all nodes use linear encoding, 
(b) all, or some nodes can only perform routing, and (c) we desire secrecy in the presence of an eavesdropper. 

In Section [IV] we consider the case where only linear network codes are allowed. We prove equivalence of zero- 
error and vanishing-error achievability, and characterise the linear network coding capacity region using representable 
functions. 

When comparing the performance of network coding to routing-only networks (where nodes can only store and 
forward received packets), it may be useful to have a capacity characterisation for routing in terms of entropy 
functions. In Section [V] we introduce almost atomic functions which provide just such a characterisation. We go 
on to consider heterogeneous networks, containing both network coding nodes and routing nodes, and show how 
to obtain an entropy function characterisation of the capacity region. 

In Section [VI] we impose secrecy constraints, where we assume the presence of eavesdropper who has access 
to certain links and desires to decode particular sets of sources. The objective is to design a transmission scheme 
such that the eavesdropper remains ignorant of the source messages. We will once again characterise the resulting 
general secure network coding capacity region via representable functions. 



Finally, in Section VII we will consider two very simple network coding problems where despite the apparent 
simplicity of the setup, characterisation of the capacity region turns out to be extremely difficult, and linear codes 
may be suboptimal. The first is incremental multicast, where the sources and sinks are ordered such that sink i 
demands sources 1,2, ...,£. The second example is secure unicast of a single source. This demonstrates that the 
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seemingly innocuous addition of a security constraint loosens the minimum cut bound [5|. Similarly we see that 
the min-cut result from [6| does not hold even for this simple case. 

Notation: K is the set of all real numbers and Ro is the set of all nonnegative real numbers. Random variables 
will be denoted by uppercase roman letters X and sets will be denoted using uppercase script X. The power set 
2 X is the set of all subsets of X. For a discrete random variable X taking values in the set (or alphabet) X, its 
support SP(X) is 

SP(X)={a; € X : Prpf = x) > 0}. 

Realisations of a random variable will typically denoted via lowercase x. 

For sets {Xi,X2, ■ ■ ■ ,X n } and S C {1,2, . . . , n}, the subscript notation Xg will mean {Xi,i E S}. Where 
it will cause no confusion, set notation braces will be omitted from singletons and union will be denoted by 
juxtaposition. Thus A U B U {i} can be written ABi and so on. Ordered tuples will be denoted 

(x(i),i = 1,2, . . . ,n) = (x 1 ,x 2 , ■ . -,x n ). 
II. Background 

In this section we provide the formal problem definition for transmission of information in networks consisting of 
error-free broadcast links. This includes representation of such networks as hypergraphs, the notions of a multicast 
connection requirement, network codes and zero-error or vanishing-error achievability. We then review existing 
results on characterisation of the network coding capacity via the use of entropy functions. In this section, we 
do not impose any additional constraints (such as linearity or security) beyond zero- or vanishing-decoding-error 
probability. 

A. Unconstrained Network Coding for Broadcast Networks 

We represent a communication network by a directed hypergraph G = (V,£ ). The set of nodes 

v = {v 1 ,...,v M } 

and the set of hyperedges 

£={E 1 ,...,E ]£l } 

respectively model the set of communication nodes and error-free broadcast links. In particular, each hyperedge 
e e £ is defined by a pair (tail(e), head(e)), where tail(e) € V is the transmit node and head(e) C V is the set 
of nodes which receive identical error-free transmissions from tail(e). When head(e) is a singleton, e models an 
ordinary point-to-point link. 

We assume that the network is free of directed cycles (a nonempty sequence of links {/i, . . . , /&} such that 
tail(/,) e head(/j_i) for i = 2, . . . , A; and tail(/i) G head(/ fc )). 
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Definition 1 (Connection Constraint): For a given communication network G, a connection constraint M is a 
tuple (S, O, D), where S indexes the sources, O : 5 ^ 2 V specifies the source locations and D : S ^ 2 V specifies 
the sink nodes. Unless specified otherwise, we let 

S = {Si, . . . ,S\ S \} 

be the index set of |<S| independent sources. Source s e S is available at every node in O(s) C V. Note that in 
general each source can be available to more than one network node. The sink nodes D(s) C V are nodes where 
source s should be reconstructed according to some desired error criteria. 

It is conceptually useful to imagine each source s as a message sent along an imaginary source edge, which for 
simplicity will also be labelled s. In this case, we can use the notation head(s) to denote O(s). For any e e £ and 
ii e V, we define 

in(e) = {/ e S U £ : tail(e) e head(/)} (1) 
in(u) = {/ e SU£ : u e head(/)}. (2) 

In other words, in(-) is the set of incoming edges (including the imaginary source edges). 

Solution of the network coding problem P = (G, M) requires a transmission scheme allowing source s to be 
reliably reconstructed at the sink nodes D(s). We have not yet specified the transmission capacity of each hyperedge, 
or the rate of each source. Characterisation of the capacity region for network coding means determination of the 
combinations of source rates and hyperedge capacities which admit a network coding solution. Before we can 
proceed, we need to formalise what we mean by network code, reliable and rate-capacity tuple. 

Definition 2 (Network Code): A network code 

$ 4 : e G £} (3) 
for the problem P = (G, M) is a set of local encoding functions 

<t>e- n yf^y- 

/ein(e) 

where y s is the alphabet of source s € S and 34 is the alphabet for messages transmitted on hyperedge e e £. 
Each network code induces a set of random variables 

{YfJeSuS}. (4) 

as follows: 

1) {Y s ,s G S} is a set of mutually independent random variables, each of which is uniformly distributed over 
its support and denotes a message generated by a source. 

2) For each e £ £, 

Y e = <j> e (Yf : / e in(e)) (5) 
and denotes the message transmitted on hyperedge e £ £. 
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If the set of random variables induced by a network code is given, then the local encoding functions ^ are 
determined with probability one. In other words, if 

Pr(YjF = W ,/ein(e))>0, 

then for all y e 7^ 4> e {Vf '■ f € in(e)), 

Pr (y e = j /e |y / = 2// ,/ein( e ))=o. 

For this reason, we will often specify a network code by its set of induced random variables. 

The following lemma follows directly from the above definitions and gives a necessary and sufficient condition 
under which a set of random variables is induced by a network code. 

Lemma 2.1: A set of random variables {Yf,f € SU£} defines a network code, with respect to a network coding 
problem P, if and only if 

1) Y s is uniformly distributed over its support for all s G S. 

2) H(Y s ,s£S) = Y, seS H{Y s ). 

3) H (r e I Y f : f G in(e)) = for all e G £. 

Conditions 2) and 3) are due to the mutual independence of the sources and the deterministic encoding constraints. 
Definition 3 (Rate-Capacity Tuples): For a network coding problem P let 

X (P)^4 S| xK{fl. 

be the set of all rate-capacity tuples 

{X,uj) = (A(s) : seS, oj(e) : e £ £) 

for P. 

Definition 4 (Fitness): A rate-capacity tuple (A,cj) G x(P) is fit f° r a network code {!/,/ G S U £} on P if 
there exists c > such that for all e G £ and s G S, 

A(«)<clog|SP(y,)|, (6) 

w(e)>clog|SP(Y e )|. (7) 

The tuple is asymptotically fit for a sequence of network codes {YJ l , f G 5 U £ } for n = 0, 1, . . . if there exists a 
sequence c„ > such that for all e G £ and s G 5, 

lim c„log|SP(r; l )| > A( S ), (8) 

lim c„log|SP(Y e ™)| < w(e). (9) 

n— >oo 

Note that fitness does not imply achievability of a rate-capacity tuple, rather that the tuple is not impossible. 
Fitness indicates that (up no normalisation) each individual source rate is not too large to be achieved by the 
corresponding source variable with the given alphabet size, and that each hyperedge capacity is large enough to 
carry the corresponding edge variable regardless of particular distribution. 
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Definition 5 (Zero-error Achievable Rate-Capacity Tuples): A rate capacity tuple 

(\,ui) = (A(s) : seS, oj(e) : e E £) 

is called zero-error achievable, or O-achievable if there exists a sequence of network codes n = 1, 2, . . . and 
corresponding induced random variables {YT : / G £ U S} such that 

1) (A, cj) is asymptotically fit for 

2) for any source s E S and receiver node u G D(s), the source message YJ 1 can be uniquely determined from 
the received messages ^Yj! 1 : / G in(tt)V In other words, 

H(Y s n | 17,/ e in(u)) = 0. (10) 

In Definition |5J each network code in the sequence has zero probability of decoding error. Relaxing this criteria 
to allow decoding error probability that vanishes in the limit, we have the following definition. 

Definition 6 (Vanishing Error Achievable): A rate capacity tuple (A, uS) is called vanishing error achievable, or 
e-achievable if the tuple is asymptotically fit, and 
2') for all s E S and u E D(s), there exists decoding functions g™ u such that 

lim Pr(Y s " ? glJXf : / G in(«))) = 0. 

In other words, decoding error probabilities vanish asymptotically. 
For any subset 7Z C x(P), define CL(1Z) as the subset of x(P) containing all tuples (A, w) such that there exists 
a sequence of (A™, a;™) G 1Z and positive numbers c n satisfying 

lim c n ui n (e) < w(e), (11) 

n— >oo 

lim c„A"(s) > A(s). (12) 

n— >oa 

Clearly, if every tuple in 1Z is O-achievable/e-achievable, then CL(7?.) is also O-achievable/e-achievable. 

The central theme of this paper is the characterisation of O-achievable and e-achievable regions for network 
coding via the use of entropy functions. 

Definition 7 (Entropy Function): A set of random variables {Yj-, i G Af} (where J\f is some index set) induces a 
real entropy function h : 2^ H> K such that for any a C A/", 

ft (a) =H(Y h i E a) 

is the joint Shannon entropj^jof (Yj : i E a), which according to our notational conventions we will also write 
H (Y a ). 
Let 

2 We define h (a) = whenever a is an empty set. 
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be the 2^1 -dimensional Euclidean space whose coordinates are indexed by subsets of Af. Thus, any element 
g € W[Af] has coordinates (g(a), a C TV). Elements of H[Af] are called rank function^ Clearly, entropy functions 
are rank functions. 

Definition 8 (Entropic Functions): A rank function h G H[Af] is 

• Entropic if h is the entropy function of a set of \Af\ random variables. The set of entropic functions is denoted 
F*(Af) C H[Af] [ 1 1 . When the index set Af for the set of random variables is understood, we simply denote 
the set of entropic functions as T*. 

• Weakly entropic if there exists c > such that c • h is entropic. 

• Almost entropic if there exists a sequence of weakly entropic functions h % such that 

lim h l = h. 

The set of almost entropic functions is f *. 
For any rank function g £ "H[Af], define the notations 

g(a\0)±g{aU/3)-g(p), (13) 
g(aA/3)±g(a)+g(p)-g(aUp). (14) 

If g is in fact an entropy function induced by random variables {Yi,i 6 Af}, then g(a \ 0) is the usual conditional 
entropy H (Y a | Yg) and g(a A/3) is the usual mutual information I(Y a ; Yp). We avoid the standard notation /(•; •) 
since it hides the underlying entropy function, which will be critical in most of what we do. 

The set T* plays an important role in information theory. Characterisation of this set amounts to characterising 
every possible information inequality. Thus T* essentially fixes the "laws" of information theory. However it turns 
out that r* has a very complex structure and an explicit characterisation is still missing [11 J. It has been proved 
that the closure f * is a closed convex cone ifTUl and hence is more analytically manageable than V* . For many 
applications, it is in fact sufficient to consider T*. However, it was proved in [12] that when \Af\ > 3, 

r* ^ f*. 

B. Existing Results 

For a given network coding problem P = (G, M), let T*(P) and H[P] respectively denote T*(SU£) and H[SU£}. 
Define the coordinate projection 

proj P : %[P] h> x(P) 

such that for any h € H[P], 

projp^Ks) = h(s), Vse5 (15) 
proj P [/i](e) = h{e), Ve € £ (16) 

3 This terminology comes from matroid theory and does not imply that such functions must be defined via ranks of linear operators - although 
such functions are rank functions by this definition. 



January 6, 2012 



DRAFT 



s 



Similarly, for any subset 1Z C H[P], 

proj P [7^{proj P [fc] :h€H}. (17) 

Again, if the underlying network coding problem P is understood implicitly, we will simply use the notations proj [h] 
and proj [JZ]. 

Consider any network coding problem P = (G, M). Define the following subsets of W[P]=H[S U £]: 

C l (P)±lh€H[P]:h(S)=Y,Hs)\, (18) 
I ses ) 

CT(P)={h€H[P}:h(s\m(e))=0,\/e€£ }, (19) 

, a ihen[P] : h(s | in(u)) = 0, 1 
C D (P)=< >. (20) 

I Vs6 5,«6i)(s) J 

The above subsets will be denoted by C\ , Cj and Cd respectively if the network coding problem P is understood 

implicitly. Consider a network code {Yj,, i G SUE} with induced entropy function h E H[SU£]. By Lemma 2.1 we 



see that h € C| since the sources are mutually independent, and h £ Cj due to deterministic transmission through 
the network. If the network code is zero-error, h e Co follows from the decodability constraint (jT0j». 
The set of e-achievable rate-capacity tuples can be characterised exactly as follows (TJ. 

Theorem 2.1 (Yan, Yeung and Zhang - e-achievable Region SJ]/): For a given network coding problem P = (G, M), 
a rate-capacity tuple (X,uj) € x(P) is e-achievable if and only if 

(A,w) e CL(proj p [con(r* nc, nc T ) nc D ]). (21) 

Inner and outer bounds for the 0-achievable region were also investigated in fOl using a similar framework as 
in HI, H. However, lfT3ll allowed the use of variable length coding, where the amount of data traffic on a particular 
link is measured as the average number of transmitted bits. 

In contrast, this paper studies the worst case scenario where the amount of traffic is measured by the maximum 
number of bits transmitted on a link (hence, it is sufficient to consider fixed-length codes). It is worth pointing out 
that when decoding error is not allowed, there is a significant difference between using the average or the maximum 
number of transmitted bits. For example, consider a source X compressed/encoded by an optimal uniquely-decodable 
code. The average length of resulting codeword is roughly equal to H(X). However, for all uniquely-decodable 
codes, the maximum length of the encoded codeword must be at least log|SP(X)|, which can be much greater 
than H(X) if X is heavily biased. 

The following outer bound follows directly from Theorem 2.1 but was proved earlier in 

Corollary 2.1 (Yeung - Outer Bound ^/): If a rate-capacity tuple (A, oS) is e-achievable, then 

{X,u) G CL(proj p [f* nCi nCj nC D \). (22) 
This bound |22) is not necessarily tight, since T* is not closed and convex in general. Therefore, 

con(r* n C\ n c T ) 
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theoretically may be a proper subset of 

con(f * n C\ n C T ) = f * n c, n C T 

where (a) follows from that T* is a closed and convex cone. 

It is clear that if (X,oj) € x(P) is O-achievable, then it is also e-achievable and hence must satisfy the outer 
bound ( p2] > in Corollary |2.1| In fact, it can be seen directly that ( |22] > must be an outer bound for the set of 0- 
achievable rate-capacity tuples. Suppose (X,lu) G x(P) is O-achievable. Then there exists a sequence of network 
codes {Yf, f € S U £} with induced entropy functions h n and positive constants c n such that for all e £ £ and 
seS 

lim c„ff(y™) < lim c„log|SP(y e ")| < w(e), 

n-4-oo n— J-oo 

lim c„ff(F s n ) = lim c„log|SP(Y s n )| > A(s), 
and that ft," <E C\ n C T n C D . Consequently, 

(A,w) e CL(proj p [r* nc, nc T nc D ]). 



The proof that ( |22| ) is an outer bound for e-achievable tuples is similar. However, as vanishing error is allowed, 
Fano's inequality is invoked to ensure lim n ^oo c n h n E Co- 
in the next section, we deliver our first main result, namely that (|22|> is tight when the sources are colocated. 



III. Tightness of Yeung's Outer Bound 



The analytical challenges in characterising V* (let alone its intersection with C\ and Cj) may render Theorem 2.1 



unattractive as a characterisation of the network coding capacity region. In this section we show that the more 



manageable bound of Corollary 2.1 which involves the closure of V* is in fact tight when the sources are colocated, 



a notion that we make precise below in Definition [9] Our proof will use quasi-uniform random variables, discussed 



in 



III-A which are a valuable tool in proving zero-error results. The proof of the main result, Theorem 3.1 is given 
in lllFBl 

Definition 9 (Colocated sources): Consider a network coding problem P = (G, M). Its sources are called colo- 
cated if 

0{s) = 0(s r ), Vs,s'eS. 

In other words, if a node has an access to any source s, it also has access to all the other sources. 

Theorem 3.1 (Colocated sources): Consider a network coding problem P = (G, M) with colocated sources 
according to Definition [9] Then 

1) A rate-capacity tuple (A, a;) is O-achievable if and only if it is e-achievable. 

2) The outer bound in Corollary |2.1| is tight. 



We fail to prove the tightness of the outer bound in Corollary 2.1 when sources are not colocated. However, we 



will give evidence in |III-C| to support our conjecture that the outer bound should be tight in general. 



January 6, 2012 



DRAFT 



10 



A. Tools: Quasi-Uniform Random Variables 



Before we prove Theorem |3.1| in Section |HI-B| we introduce key tools and intermediate results. In particular, the 
proof relies on the concept of quasi-uniform random variables, which are crucial for proving zero-error results. 

Definition 10 (Quasi-Uniform Random Variables [11 J): A set of random variables {Xi,i £ J\f} is called quasi- 
uniform if for any subset a C J\f, the random variable X a =(Xi,i £ a) is uniformly distributed over its support, 
or equivalently, 

H(X a )=log\SP(X a )\. 

Lemma 3.1: Suppose {^4, B} is quasi-uniform. Then one can construct a random variable W such that 

H(W) = H (A \ B) , 
H{A\B,W)= 0. 



Sketch of proof: As {A, B} is quasi-uniform, it can be proved from Definition 10 that for any b £ SP(B), 
Pt(A = a\B = b) = 



2 -H(A\B) if p r (A = a, B = b) > 
otherwise. 



Assume without loss of generality that 

{q(l,b),...,q(2^ A \ B \b)} 
is the set of all elements in SP(A) such that 

Pr(A = a | B = b) > 0. 
Let W be a random variable such that for any (a, b) in SP(A, B), 

1 if a = q(w, b) 



Pr(W = w\ A = a,B = b) = 



otherwise. 



The lemma can then be verified directly. ■ 
Definition 11 (Quasi-Uniform Rank Functions): A rank function h £ H[JV] is called 

• Quasi-uniform if h is the entropy function of a set of \J\f\ quasi-uniform random variables. 

• Weakly quasi-uniform if there exists c > such that c ■ h is quasi-uniform; 

• Almost quasi-uniform if there exists a sequence of weakly quasi-uniform rank functions h % such that 

lim h 1 = h. 

i— > oo 

Lemma 3.2: If hi,h 2 £ H[Af] are quasi-uniform, then their sum, defined for all A C M as hi(A) + h 2 {A), is 
also quasi-uniform. 

Proof: Suppose A^ and B^ are two independent sets of quasi-uniform random variables whose entropy 
functions are h\ and hi respectively. It is straightforward to construct a new set of variables Xjs/ with entropy 
function hi + hi, via 

X i = {A i ,B i ), Vi£N. 
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The lemma follows, since Xtf, and hence h\ + h 2 , is quasi-uniform. ■ 
For any weakly entropic function h, lfl4ll explicitly constructed a sequence of weakly quasi-uniform functions 

with limit h. It can be verified directly that this sequence of weakly quasi-uniform functions satisfies the same 

functional dependency constraints as h. Hence, we have the following proposition. 

Proposition 3.1: For any weakly entropic rank function h, there exists a sequence of quasi-uniform random 

variables {Uf, i £ Af} and positive numbers C£ such that 

1) For any a C Af, 

lim c e H (Ul) =h(a). (23) 

2) Uh(k\a) = 0, then 

H(U%\Ui)=Q, for all I. (24) 
In other words, h is the limit of a sequence of weakly quasi-uniform functions f e where 

h (k | a) = =>> | a) =0. 



In fact, Proposition 3.1 remains valid even if h is almost entropic. 

Proposition 3.2: For any almost entropic rank function h 6 T*(Af), there exists a sequence of quasi-uniform 
random variables {Uf,i € A/^} and positive numbers q such that ( p3~| and ( f24] > hold. 

Proof: By [14], there exists a sequence of quasi-uniform random variables {Uf,i € A/"} and positive numbers 
q such for all aCAC, 

lim Qi? (Ul) =h(a). (25) 

The challenge however is that ( |24] > may not hold if h is not weakly entropic (we only know that h is the limit of a 
sequence of weakly entropic functions). In the following, we will show how to modify the {Uf, i £ AT} such that 
( |24| i indeed holds. First, notice that {Uf,i £ Af} is quasi-uniform. Hence, for any k £ Af and a C Af, {Uj:, U„} 
is quasi-uniform. By Lemma 



3.1 



one can construct a random variable Wj: such that 



H{wi ta )=H{Ul\Ui), (26) 
H(UI\ULWL)=0. (27) 



k I u ai vy k.a 

Let 

^=K„.(M)eA} 

where 

A^{(fc, a) : h(k | a) = 0, k £ Af, a C A/"} 
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Then 



< lim c t H (W A ) 



<limc £ £ H ( W L) 



(k,a)eA 



}imc e H(Ui\Ui) 



(k,a)eA 



lim ce(H(Ui,U e a )-H(Ui)) 

* » /-von 



f — S-OO 

(fc,a)GA 



( = } ]T (h(k,a)-h(a)) 

(fc,a)SA 



= 

where (a) follows from ||2"5). Consequently, 



Urn QiJ (Wi) = 0. (28) 



We now construct our new set of random variables, {V t e ,i £ AT} by defining 

It is obvious that H(V£ \ V f a ) = for all (fc, a) £ A. Let f be the entropy function of Vfo. Then by (f25]), for any 

/3 c TV, 

h(p)=KmctH(Ug) (29) 
<Um Q^(C/j,Wi) (30) 
< Urn ctH (Uf) + lim c e H (w£) (31) 
( = } jfactH (Ujg) (32) 
= /i(/3). (33) 

where (b) is by ( |28] i. Consequently, 

lim c,/ £ (/3) = lim c,iJ {UL W e A ) = h{0), V/3 C M. 



to 



Since / is weakly entropic and / (fc | a) = for all (fc,a) € A, we can once again use Proposition 3.1 
construct a sequence of weakly quasi-uniform functions g J such that lirrij_ i . 00 — h and gi(k \ a) = for all 
(fc,a)eA. ■ 

B. Proof for Theorem \3.1\ 



The first claim of Theorem 3.1 is that the e-achievable and 0-achievable regions are equivalent when sources are 



colocated. For any network coding problem P = (G, M), it is clear that if (X,uj) £ x(P) is 0-achievable, then it is 
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also e-achievable and hence must satisfy the outer bound ( |22| > in Corollary |2.1| Thus, to prove Theorem 3. 1 it suffices 
to show that for colocated sources, the rate -capacity tuple proj p [/i] is O-achievable for all h G f * n C\ n Cj n Co- 

Our proof technique is similar to that used in JT]]. However, instead of constructing network codes from strongly 
typical sequences, we use quasi-uniform random variables. Codes constructed from typical sequences admit a small 
(but vanishing) error. However, as we shall see, codes constructed from quasi-uniform random variables can be 
carefully designed to ensure zero decoding error probability. 

Consider a network coding problem P = (G, M) where all sources are colocated. Suppose 

hef* nd nc D nc T . 



Since h is almost entropic, Proposition 3.2 implies the existence of a sequence of quasi-uniform random variables 

{UfJeSUS} (34) 

and positive numbers c n such that 

lim c n H (£/") =h(a), VaCSUS (35) 

n— foo 

H ( U e I Eft(e)) - 0, Vee£ (36) 



H 



( u s I ^in(u)) - °> y S eS,ueD( S ) (37) 



where (a) is due to h £ Cj and (6) is due to h € Co- Furthermore, by Lemma 3.2 the sum of two quasi-uniform 
rank functions is quasi-uniform. Hence, we can assume without loss of generality that 

lim Cn = (38) 



and Hill™) grows unbounded. This assumption ( |38f l will be used in the latter part when we construct a zero-error 
network code. 

In the following, for each n, we will construct a zero-error network code {Yj l ,f £ 5U£} from each set of 
quasi-uniform random variables {Uf, f € S U £} such that 

lim c n H (y™) = h(s) (39) 

n— >oo 

lim c n H (y™) < h(e) (40) 

and consequently, proj p [/i] is O-achievable and the outer bound is tight. 

Code construction: For simplicity of notation, we will drop the superscript n in ([34} and directly denote the set 
of quasi-uniform random variables by 

{U f JeSu£}. 

Suppose first that the U s ,s g S are mutually independent. The U s are quasi-uniform and hence uniformly 



distributed over their support. Thus ([36} holds and Lemma 2.1 implies that {Uf,f £ SU£} in fact defines a 
network code. Furthermore, by ( [37) , the decoding error probability is zero, and Theorem |3.1| is proved for this 
special case of independent sources. 
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Unfortunately, {U s , s G S} need not be mutually independent in general. To address this problem, we will modify 
these variables to satisfy the independency constraint. This is when we require all sources to be colocated. 
Since the network G is acyclic, repeated application of d36| can be used to prove that 



H(u f | u s ) = o, vfesue. 

Hence, for any e e £, there exists functions G e such that 

U e = G e {U s ). 

Similarly, for any e G £, s G S and v G D(s), there exists functions g e and g s <v such that for all u$ G SP (Us), 

G e (u s ) = 9e(G f (us)Je in(e)), Ve G £ (41) 
JZs = 9s,v(G f (us)J G in(u)), 

Vs G 5, u G D(s). (42) 

where (a) follows from <j36j and (b) from P7) . 

Definition 12 (Partition): Let ^4 = {1, 2, . . . , \A\] be an index set. A partition of a set X into |^4| partitions is 
a mapping 

E : A ^ 2^ 

where H(i) C X is the set of elements in partition i G A. If the 3(i) are disjoint then the partition 3 is called 
disjoint. 

Definition 13 (Regular Partition Set): Let £/ s ,s G 5 be random variables with supports SP(U S ). For s E S, 
define the index sets A S ={1, . . . , |As|}, where \A S \ < \SP(U S )\, and let 

be a disjoint partition of SP(Z7 S ) into |^4 S | subsets. We call the set of partitions {S s , s £ S} a regular partition set 
for {U s , s G S} if and only if for all 65 G As 

SP(U s )n l[S s (b s ) ^0. (43) 

Note that ( |43] l is a non-trivial condition, since in general SP(Us) ^ ris^^C^s) when the U s are not necessarily 
independent. 

We will now construct a zero-error network code {Yf, f E S U £} from a regular partition set. Let {5 S , seS} 
be a regular partition set according to Definition [13] By (|43]l, for each s G 5, there exists mappings 



T s : J]A^SP(C/ S ), (44) 



ies 
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such that for s € S and b$ £ YlieS 

(Ti(b s ),T 2 (bs),...,T\ s \{bs)) €SP(U S ) and 
(Ti(bs),T 2 (bs), . • • ,T lsl (b s )) e 5(&i) x S(6 2 ) x • • • x 3(6, S |). 
We can write this more concisely as 

(T s (b s ), s€S)e SP(U S ) n Ei(h). (45) 

By (g5j, 

r s (i s )es s (6 s ). (46) 

And as S s is a disjoint partition, b s can be uniquely determined from T s (bs). 

Now let {Y s , s £ S} be a set of mutually independent random variables such that for each s £ S, Y s is uniformly 
distributed over A s - Also, for each s £ S, define auxiliary random variables Z s such that 



By (|4"5|l and (05), it is easy to see that 



Further define 



It is now easy to see thaj^] 



Following from ( |5Tj ), we have that 



Z s = T S (Y S ). (47) 

SP(Z 5 ) C SP((7 5 ), (48) 

H(Y s \Z s )=0, s£S. (49) 

^G e (2 s ),eef. (50) 

SP(Z S£ ) C SP(Efe £ ). (51) 

H(Z k \Z a )=0 (52) 



whenever (ZTj. | J7 a ) = for some k £ S U £ and aCSUf. 
Let F e = Z e for all e € S . If in(e) H 5 = 0, then 

i ff (C/ e | u m[e) ) 

— -ff (^e I ^in(e)) 

( ^i/(y e |^ n(e) ) 

4 Recalling our convention to denote set union by juxtaposition. 
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where (i) follows from ( [36*1 ), (m) from ( "52]) and (Hi) from the fact that in(e) C E. Now, suppose in(e) n S ^ 0. As 
all sources are colocated in P, then in(e) n S = S. In this case, application of ( |47| i and ( |50] > yields 

o < //(y e | y n (e)) <fi"(n I >5) = H{Z e I y 5 ) = 0. 

Finally, for any seS and it G -D(s), 

= H (U s | C/ in(u) ) 



HfZ, I Z; 



in(«)J 



n(«)J 

where (a) follows from ( |49) . Hence {isf} defines a zero-error network code for P. Furthermore, it is easy to see 
that 



Hence, (77, £) is 0-achievable where 



iog|sp(y s )| = io g |AI 

log|SP(y e )| <log|SP(C/ e )| =H(U e ) 

7}(a) = log |^.|, VseS, 
C(e)=log|[/ e |, Vee£. 



(53) 
(54) 

(55) 
(56) 



The final ingredient of the proof is the following proposition which will be proved in Appendix [A] 
Proposition 3.3: Let {U\, . . . , Us} be a set of quasi-uniform random variables. If H (Ui) is sufficiently large, 
then there exists at least one regular partition set {E s , s G S} for {SP(U S ), se5} where 

By Proposition 3.3 (and (|3~8]l), we can construct a sequence of zero-error network codes such that 

jj (y; 1 ) = if (u? 1 t/f , . . . , uiu) -2\ og H (rjf 

Finally, from ( f35| l and that h € C\, we can prove that 

lim c n H(Y s n ) =h(s), 

lim c n H (77™) < /i(e) 
n— ^00 

for all s € S and e G £. Thus, proj p [/i] is 0-achievable and Theorem 3.1 follows. 



January 6, 2012 



DRAFT 



17 



C. Generalisation - non-colocated sources 

Assuming that all sources are colocated, we proved in the previous subsection that a rate-capacity tuple is e- 
achievable if and only if it is indeed O-achievable, and that the outer bound in Corollary |2.1| is indeed tight. We 
conjecture that the same outer bound remains tight even for sources are not colocated. In this subsection, we will 
give some arguments to justify our conjecture, which hinges on whether or not removing a zero-rate link can change 
the capacity of a certain modification of the original network. 

Let P = (G, M) where G = (V, £ ) and M = (S, 0,D). We make no assumption that the sources are colocated. 
Now consider two variations on the network coding problem P. 

Variation 1 - addition of a "super node": Let P 1 =(G^, M 1 ), where the underlying network G^ = (V',£') is 
obtained from G = (V,£) via inclusion of a "super node" v* and links f s ,s E S, 

V'=VUK}, (57) 

e* = e\j{f a ,aes}, (58) 

such that tail(/ s ) = v* and head(/ s ) = O(s). Here, the links f s are just like an imaginary source edge. 
The connection constraint M 1 is (<S, O x ,D) where for all s E S, 

1 (a) =0(s)U{v*}. 

In other words, source s is available not only at source node O(s) but also at the super node v*. 

Variation 2: Let P 2 =(G^, M 2 ), where the underlying network G^ = (y*,£*) is the same as in P 1 , but the 
connection constraint M 2 = (S,0 2 ,D 2 ) is modified as follows: 

2 {s) = {v*}, Vs E S (59) 
D 2 {s) = D(s)UO(s). (60) 

Hence, all sources are available only at the super node v* . In addition, source s is required to be reconstructed not 
only at the nodes in D(s) (the sinks in the original multicast problem P) but also at the original source nodes O(s) 
in P. 

Figure [I] illustrates the differences between the original problem P and its two variations P 1 and P 2 . In this 
figure, a sink node is denoted by an open square. Labels beside each sink node indicate the set of sources required 
for reconstruction. Sources are indicated by a double circle with an imaginary link (labeled with the source index) 
directed from it to the nodes where that source is available according to the connection constraint. 

Given a rate-capacity tuple (A,o>) E x(P)> l et ^[X, ui] be a rate capacity tuple in ^(P 1 ) (f° r me network coding 
problem P 1 ) such that 

T 1 [A,cj](s) = A(s), VseS, (61) 
T 1 [A,w](e) =w(e), Ve E £, (62) 
T x [A,o;](/ s ) = 0, VsG<S. (63) 
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(a) The original problem P. (b) The first variation P 1 . (c) The second variation P 2 . 

Figure 1. Variations of a multicast problem P 

In other words, the source rates and the hyperedge capacities remain the same for those elements existing in the 
original network, and the hyperedge capacities of the new links from the super-node v* to each of the original 
source nodes are all zero. 

Similarly, let T 2 [A, u] be respectively the rate capacity tuple in x(P 2 ) (f° r the network coding problem P 2 ) such 
that 

T 2 [X,uj}(s) = A(s), VseS, (64) 
T 2 [A,cj](e) =w(e), Ve e £, (65) 
T 2 [A,u;](/ s ) = A( S ), Vs e S. (66) 
Then, it is straightforward to prove the following: 

1) T 1 [A,o;] is e-achievable with respect to P 1 if and only if T 2 [A,cj] is e-achievable with respect to P 2 . 

2) If (A,w) is e-achievable with respect to P, then T 1 [A,w] and T 2 [A,w] are e-achievable with respect to P 1 
and P 2 respectively. 

Conjecture 1: A rate-capacity tuple (A, uj) is e-achievable with respect to a network coding P if T 1 [A,w] is 
e-achievable with respect to the modified problem P 1 . 

The main difference between P and P 1 are the zero-capacity links f s for s e S. At first sight it might be tempting 
to think that zero-capacity links cannot change the capacity region, and as a result that the conjecture is trivially 
true. However proving the conjecture is not straightforward. A zero-capacity link does not mean that absolutely 
nothing can be transmitted on the link. In fact according to the definitions, a finite amount of information (that 
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does not scale with ri) could be transmitted along the link. Thus the links f s can in fact be used (as long as their 
capacities vanish asymptotically) in any sequence of network codes achieving T 1 [A,w]. In fact there exists known 
examples where zero-capacity links can indeed modify the capacity of certain multi-terminal problems, in particular 
when there are correlated sources, or non-ergodic sources. 

If Conjecture [T] does not hold, it is equivalent to saying that a link with "vanishing capacity" can indeed change 
the set of achievable tuples, even when all sources are independent and ergodic. 

Theorem 3.2: Suppose Conjecture [T] holds. Then the outer bound in Corollary |2.1| is tight even when sources 
are not colocated. 

Proof: Let h G f*(P). Define a rank function 



such that for any /? C S U £ f 



where 



It is straightforward to prove that if 



g(0) = h(a 1 Ua 2 ) (67) 

ai = p\{f Sl seS} (68) 

a 2 = {seS: f s G P}. (69) 

h g f*(p) n C,(P) n c T (P) n c D (P), 



then 



g g r* (P 2 ) n Ci(P 2 ) n c T (P 2 ) n c D (P 2 ). 



By Theorem 3.1 proj p2 [<?] is e-achievable with respect to network coding problem P. As 

T 2 [proj p [/i]] =proj p2 [5] 

and is achievable with respect to P 2 , T 1 [proj p [/i]] is achievable with respect to P 1 . By Conjecture [T| proj p [/i] is 



achievable with respect to P. Consequently, the outer bound in Corollary 2.1 is tight. 



IV. Linear Network Codes 

In the previous section, the network codes were not subject to any constraints, other than those required by 
Definition [2] and that decoding error probabilities must be zero or vanishing, according to Definition [5]or Definition[6] 

For the remainder of the paper, we will consider various subclasses of network codes which result from imposing 
different kinds of additional constraints. 

To begin with, in this section we will study linear network codes [5|, which have relatively low encoding and 
decoding complexities, making them more attractive for practical implementation. 
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Definition 14 (Linear network codes): Let 

{Y f : f G £ U S} (70) 

be a network code (according to Definition [2jl for a problem P on a network G = (V, £), with local encoding 
functions 

$ 4 {0 e : e G £}. 

The code is called q-linear (or simply linear) if it satisfies the following conditions: 

1) For s G S, Y s is a random row vector such that each of its entries is selected independently and uniformly 
over GF(q). 

2) All the local encoding functions are linear. 

A network coding problem is said to be subject to a q-linearity constraint if only q-linear network codes are allowed. 

Let the row vector Y s have X s elements. Clearly, for a linear network code ( f70] i, one can construct a J^ies ^ ! x ^ s 
matrix G s for any s£5 such that 

Y s =YxG s . (71) 

where Y=[YiY 2 . . . Y\s\] is the length J^ies ^ low vector obtained from the concatenation of the Y, t and x is the 
usual vector-matrix multiplication. 

Similarly, as all local encoding functions are linear, for each e G £, there exists a Ylies ^< x We matr i x G e such 
that 

Y e =YxG e . (72) 

Hence, the symbol transmitted on link Y e is a length u e vector over GF(q). 

Following the nomenclature in [2J, the matrices G/, / G SU£ will be called the global encoding kernels. They 
define the linear relation between Y e (the message sent along edge e) and {Y s ,s G 5} (the symbols generated at 
the sources). It is easy to prove that 

|SP(r s )| =q A = (73) 
|SP(F e )| <Q Ue - (74) 

with equality holding in (|74j> when G e has full column rank. 

A sink node u G D(s) can uniquely decode a source Y s if and only if it can solve for Y s (but not necessarily 
all other Y i7 i G S \ s) from the following linear system with unknowns Y: 

Y e = Y x G e , Ve G in(tt). (75) 

It is clear that if F s cannot be uniquely determined from ( |75| , then there must be at least q solutions to ( |75| such 
that the values of Y s in each solution are all different. We can easily show that with maximum likelihood decoding, 
the decoding error probability must be at least 1 — 1/q. Consequently, for a network coding problem subject to 
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a g-linearity constraint, a rate-capacity tuple (A, a;) is O-achievable if and only if it is e-achievable. Therefore, we 
will always assume that linear codes are zero-error codes. 

As in Theorem |3.1| for general (possibly non-linear) network codes, a characterisation for the set of O-achievable 
rate-capacity tuples subject to a q-linearity constraint can be obtained by using entropy functions. However as we 
shall see, the entropy functions for linear network codes will be constrained to be representable lfl5l in the sense 
of matroid theory. 

Definition 15: A rank function h £ % [S U £] is called q- representable if there exists vector subspaces 

U i: i eSu£ 

over GF(q) such that for all a C S U £, 

h(a) = dim (Uj,i G a), (76) 

where dim (Uj, i £ a) denotes the dimension of the smallest vector space containing all of the U», i £ a. 

In an abstract sense, representable rank functions are similar to entropy functions, where the entropies of random 
variables are replaced by dimensions of vector spaces. It is well-known that representable functions are indeed 
entropy functions (and hence also polymatroidal) iflOll . Bill . Hence we will sometimes use the following conventions. 
For vector spaces U and V, we will use 

H(Y), H(V,Y), and H(V\Y) 

to respectively denote 

dimV, dim(U,V), and dim(U,V> - dimU, 
as if U and V were random variables. 



Similar to Definition 11 we can define weakly- and almost-representable functions. 
Definition 16 (Weakly/almost representable): A function h is called 

• weakly q-representable if c • h is q-representable for some c > 

• almost q-representable if it is the limit of a sequence of weakly q-representable functions. 

We use T*(S U £) and T*(S U £ ) to respectively denote the sets of q-representable and almost q-representable 
rank functions in H[S U £ ]. 

Theorem 4.1 (Achiev ability by linear codes): For any network coding problem P = (G, M) subject to a q- 
linearity constraint, a rate-capacity tuple (A,w) is O-achievable if and only if 

(A,«) g CL(proj P [T*(p)nei(p)ne T (P)ne D (p)]) . (77) 



Theorem 4.1 (for linear network codes) is a counterpart to Theorem 3.1 (for general network codes codes). 
However, unlike in Theorem |3.1| Theorem |4.1| holds even when the sources are not colocated. 

Before we proceed to prove Theorem |4.1| (which involves proving both an if -part and an only-if part), we will 
illustrate the main idea by proving the following special case of the if-part of Theorem |4.1| 
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Proposition 4.1: Consider a g-representable function h e T*(P) such that 

/ieCi(P)ne T (P)ne D (p) (78) 

and let (A,w) = proj p [/i]. Then (A,w) is O-achievable by g-linear network codes. 
Proof: By Definition [15] there exists a collection of subspaces 

h(a) = dim (Vi, i G a). 

MS) = 



over GF(q) such that 



As h e CAP), 



ses 



Therefore, 



dim (V s 7 seS)=J2 dim V s . (79) 



Similarly, as h G C T (P) n f *(P), we can prove that 

h(e,S) = h(S) 

for all e G £ . Consequently, 

V e c(V s ,se5), VeeS. (80) 

Let k = dim (V s , s G S). By (j79j and ( |80) , we may assume without loss of generality that all elements in V/ are 
length k column vectors for / G S U £. For each s G S, let M s be a A: x /i(s) full column rank matrix such that 
the space spanned by its columns is equal to V s . Similarly, for each e G £, let M e be a k x /i(e) full column rank 
matrix such that the space spanned by its columns is equal to V e . 

Let Z be a length k row vector such that each of its entries is independently and uniformly selected from GF(q). 
Then Z, together with the matrices {Mf , / G S U £} induces a set of random variables 

{y /: /G5u£} 

such that Y f = Z x M f for all / G S U £. Setting F = s G S], and similarly M = [M s , s G 5] we have 

y = z x m. 



By ( |79| l, it is easy to see that M must be a fc x k invertible matrix. Therefore 

Z = YxA-r 1 (81) 
and consequently, Yf = Y X Gf where G/ = M^ 1 x Mf. 
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In the following, we will prove that {Yf, / 6 SU £} is indeed a zero-error linear network code. To see this, first 
notice that Y s is a random row vector of length h(s) and all its entries are independently and uniformly distributed 
over GF(q). Now, consider any e <G £■ As h € Cj, 

/i(e, in(e)) = /i(in(e)). 

Therefore, 

dim <V e ,V / ,/ein(e)} = dim <¥/,/€ in(e)> 

or equivalently, 

V e C<V/,/Gin(e)). 
Consequently, we can construct a matrix ?/j e such that 

M e = [M/,/G in(e)] x 

Therefore, 

F e = Z X M e 

= 2x [M /( /ein(e)] X Ve 
= [ZxM / ,/Gin(e)] x^ e 

= [y / ,/Gin(e)]x^ e . (82) 



The equation ( |82] i clearly indicates that the local encoding functions are indeed linear. 
Similarly, as h € Cd(P), 

/i(s, in(u)) = /i(in(ti)) 
for any it g f(s). We can once again construct a "decoding function" ip au such that 

Y s = [Y f ,f £ in(w)] x V s ,«. 

Hence, the receiver node u can uniquely decode F s from {Yt,f € in(w)} and the probability of decoding failure 
is zero. As {Yf, f € 5 U 5} is a zero-error linear network code, together with (|73j)-(|74li, (A, w) is O-achievable by 
linear network codes. ■ 



Our next step is to prove that Proposition 4. 1 holds, even when the function h in (|76|> is almost g-representable. 



Before we prove this extension, we will need a few basic results from linear algebra. 

Lemma 4.1: Let A, B be vector subspaces. Then there exists a subspace CCB such that 

(A,C) = (A,B) 
AnC = {0}. 

Consequently, H (C) = H (B | A) = H (A,B) - H (A). 
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Using Lemma 4. 1 for any subspace A of B, one can easily construct a vector subspace A* of B such that 

(A,A*)=B, (83) 
AnA*={0}. (84) 

Any vector «gB can be uniquely expressed as the sum of vectors u% G A* and G A. For notational simplicity, 
we use Ta(u) to denote u\. Similarly, 

T A (W)±{T A (u) : u G V}. 

Clearly, if V is a subspace, then so is 7a(V). 

In general, T A (u) depends on the specific choice of A*. However, all the results mentioned in this paper involving 
T A remain valid for any legitimate choice of A*. The following lemma may be directly verified. 

Lemma 4.2 (Properties): For any subspaces A,Bi,B 2 , 

T A (B!)nA = {0} (85) 
(T A (Bi),A) = (A, Ei) (86) 
H(T A (B 1 ))=iJ(B 1 | A) (87) 
(r A (B 1 ),T A (B 2 ))=T A «Bi,B 2 )) (88) 

Furthermore, if Bi C B 2 , then 

r A (Bi) C T A (B 2 ). 
Using T A , we can "transform" a set of subspaces 

{B l5 . . . ,B„} 

into another set of subspaces 

{T A (B 1 ),...,T A (B„)} 

satisfying Lemmas |4.3| and |4.4| below, which are direct consequences of Lemma |4.2| 
Lemma 4.3 (Conditioning): 

H (T A (Bj),i G a)=H (T A (Bi, i G a)) - (B l;i G a | A) . (89) 

Lemma 4.4 (Preserving functional dependencies): If 

if (B ft |B i> *Ga) = 0, 

or equivalently, B^ C (R^i € a), then 

H{T A (M k ) | T A (Bj),i G a) = 0. 
Proposition 4.2: Consider any almost g-representable function h G %[<5> U £]. Let 

A={(fe,a) :fcG5U£,aC5U5 such that h(k \ a) = 0}. 
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Then there exists a sequence of weakly q-representable rank functions h e E H[S U £] such that 

lim h e = h (90) 

and 

h i (k\a)=0, (91) 

for all positive integers £, and (k,a) e A. 

Proof: By definition, for any almost g-representable function h, there exists a sequence {Upf E S U £}, 
£=1,2,... of collections of subspaces over GF(q) and q > such that for any a C 5 U £, 

lim QiJ (Uf_) = h (a) . (92) 



For every pair of (k, a) € A, by using Lemma 4.1 we can construct a subspace W k a such that 

# = H(Ut,Ui) - £T(U<) (93) 

Ui,C (ui,W £ fc! J. (94) 
Let = (W| Q , (jfe, a) e A). By (|92]>-([93]) and the fact that 

h (k, a) = h (a) , V(fc, a) 6 A, 

we have 

lim c,ff (Wi) < lim a #( W U 

(fc,a)6A 

* 4 l—¥00 

(fe,a)£A 

= J2 (h(k,a)-h(a)) 

(fe,a)eA 

= 0. (95) 

Define a new collection of subspaces 

V e f ±(Uf,Wi), VfeSUS (96) 
and let g l be the representable function induced by 

{Y e f ,feSl)£}. 

Obviously, for all (k, a) € A, Y e k is a subspace of V„, or equivalently, g (k, a) — g l {a). By ( |9~5| l, using a similar 
argument as given in d29|-([33| in the proof of Proposition |3.2| we can also prove that 



lim egg = h. 

The proposition then follows by letting h — ceg 1 . 
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In the proof for Theorem 3.1 an extra step (via the introduction of a regular partition set) is taken to construct a 
network code from a set of quasi-uniform random variables. This extra step requires that all sources are colocated. 
As we shall see, when we construct linear codes from a set of subspaces, the colocated assumption is no longer 
needed. 

Lemma 4.5: For any subspaces A, B, 

H (A | B) = H (A | A n B) . (97) 

Proof: Direct verification. ■ 
Lemma 4.6: Let {V/, / £ S U £} be a collection of subspaces. Then there exists a subspace A such that 

H(T A (Y S ), s e S) = J2 H(T A (V S )) 



ses 



and 



Proof: Let 



and A=(W s ,s€ S). Then 



H (A) = H (V s ) - H { y ° I ^s\s) 
ses 



W S =Y S n (Yj ,j E S\s),Vs e S (98) 



H{Y S | A) = H(Y S \Wi,ieS) 

>J2 H (^s | Wj,t6 S,Y k ,k^ s) 

seS 

( = } ]Ttf(V s | W s ,Y k ,k^s) 
ses 

(V s | Y k ,k^s) 

seS 

(c) 



>^ff(V s I Wi,ie5) 

sG5 

= J2h(y s \a) 



ses 

>H(Y S \ A) 



where (a) follows from that Wfe C Y k , (b) from that Wfc C (Y i , i ^ k), and (c) from Lemma 4.5 As all the above 
inequalities are in fact equalities, we have 



H(Y S | A) = Y,H(Y S I A) = ]T#(V S \Y k ,k^s) 



seS seS 
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Finally, as A C (Y s ,s e S), 



H (A) = H (Y s ,s eS)-H(Y s \ A) 

= H (Y s ,s eS)-^2H(W s \W kl k^ s) 



(99) 
(100) 



s£S 



We now have all the elements required to prove Theorem 4. 1 

Proof of Theorem \4.1\ : 
We begin with the if-part. Suppose 

he T*nCinc T nc D . 



Using Proposition 4.2 we can construct a sequence of q-representable functions / and q > such that 



lim cif = h 

l—^oo 

and each f e satisfies all of the same functional dependencies as h, i.e. 

h(k\a)=0 => f(k \a) = 0. 

In particular, 



r ee T nC c 



For each t, by definition, there exists subspaces 



{Ui,i e SUE} 



over GF(q) such that f{a) = H (Uj,i € a) 



Then by Lemma 4.6 there exists a subspace A such that 

H(T At (V e s ),se S) = J2H(T A e(V e s )) 

and 

H (A')^H (Vises)- Y.H^IWQ 

Let g e be the representable function induced by the subspaces 

{2V(Uf),ie<SU£}. 



(101) 



(102) 



Then by ( fTOT) , g e e C\. As each /'eC T n C D , by Lemma |44| / eC T fl C D . By Proposition |4T| projp^] is 



O-achievable. 
Due to ( fT02l ) 

and hence, 



lim q# (A £ ) = 0, 



lim cig = lim q/ = h. 
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Thus, proj p [/i] is also O-achievable and the if-part of Theorem 4.1 is proved. 

Now, we will prove the only-if part. Suppose (A, w) G T(P) is O-achievable subject to a q-linearity constraint. 
Then there exists a sequence of zero-error linear network codes {YJ 1 , f E S U £} and positive constants c n such 
that 

lim CnH(Y e n ) < lim c„log |SP(Y™)| < w(e), Ve G 5, 

n—too n— >oo 

lim c n H(Y s n ) = lim c„log |SP(Y™)| > A(e), Vs G 5. 

Again, each set of random variables {YJ 1 , f 6 SU£ } induces a g-representable function h n such that H \ Yp, f E aj - 
h n (a) for all a C 5 U £. Since {YJ 1 , / G 5 U £} is a zero-error linear code, we have h n G T* n C| n C T n C D . 
Therefore, 

(A,w) g CL(T;nCinc T nc D ) 

and the theorem is proved. ■ 

V. Routing 

Another class of network coding constraints that is of great practical importance is routing, which requires that 



network nodes perform only store-and-forward operations. We will consider two main cases. In Section V-A 



we 



consider networks where all nodes must perform routing. In Section V-B we consider heterogeneous networks 
consisting of both routing and network coding nodes. 



A. Routing-only schemes 

We first consider networks where the nodes are only able to perform routing. We will formalise what we mean 
by "routing" later, and proposed a generalisation. In such routing-based schemes, information is transmitted from 
the sources to the destinations via a collection of "routing subnetworks". 

Definition 17 (Routing subnetworks): For any given network coding problem P = (G, M), a routing subnetwork 
is a subset T of S U £ such that 

1) |Tn S\ = 1. Thus, T is associated with a source and we denote that unique source in Tn S by v{T). 

2) For any link e G Tn £, in(e) n T ^ 0. In other words, either there exists another link / G T such that 

/ G in(e), 

or v(T) G in(e), i.e., the originating node of link e is a source node of v(J~). Hence, the subnetwork formed 

by the set of links in T is in fact "connected" and is "rooted" at v{T). 
A routing subnetwork is in fact a simple generalisation of the usual multicast trees used in networks with point- 
to-point links (i.e. the underlying network is a directed graph) for constructing a routing solution (where messages 
are being forwarded and relayed at intermediate nodes without coding). While it is sufficient to consider multicast 
trees in such networks, the concept of multicast trees does not extend naturally to wireless networks (where the 
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underlying network is a directed hypergraph). In particular, in our hypergraph model, links £ are broadcast, i.e., 
the message sent over a link e can be received by more than one node. Therefore, it is not reasonable (and also 
not necessary) to insist that there is a unique path connecting a source to a sink. According to our definition of 
a multicast constraint, sources may also be available at more than one node. Therefore, the condition s € in(e) 
means that there exists a node u G O(s) such that u = tail(e). 
Definition 18 (Achievability): A rate-capacity tuple 

(A,w)ex(P) 

is called O-achievable subject to a routing constraint if there exists a collection of routing subnetworks 71 and 
subnetwork capacities Ci > such that 
(Rl) For any edge e e £, 

w(e) > ]T Ci. (103) 

i:eeTi 

(R2) For any i and u e D(v(Ti)), there exists eeTi such that u e head(e). In other words, the node u is on 

the routing subnetwork. 
(R3) For any source seS, 

A(«)= X! c - (104) 

i.u(Ti) = s 

Clearly, these three conditions are not chosen arbitrarily but have a meaning in practice. Suppose (A, to) is ID- 
achievable subject to a routing constraint. This tuple corresponds to a zero-error routing solution defined as follows: 
Assume without loss of generality that A(s),w(e) and a are all positive integers. For each se5, let the source 
message Y s be a g-ary row vector of length A(s). For each i, one can use the routing subnetwork % to transmit Cj 
g-ary symbols of Y s from the source nodes (which have access to the source Y s ) to all sink nodes u e D(s). By 
(R2), it is guaranteed that all sink nodes receive all A(s) g-ary symbols of Y s and hence can decode Y s . Furthermore, 
a link e e £ is used in the routing subnetwork % if e e 77. Therefore, 

is the total number of g-ary symbols that have been transmitted on link e. Clearly, the rate-capacity tuple (A, to) is 
fit for this routing based solution. 

In this routing solution, a source node does not perform any coding, except for partitioning a source message 
into several independent segments, and forwarding each segment via a routing subnetwork to the corresponding 
sink nodes. This corresponds to the usual concept of routing in networks consisting of point-to-point links. For 
successful decoding, a sink node must receive every segment of the source message from the required sources. 

In the following, we consider a slight generalisation of the concept of routing, where source nodes can encode 
source messages into correlated segments (corresponding to intra-session coding). By doing so, we can weaken the 
conditions (R2) and (R3). 
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Definition 19 (Generalised routing constraint): A rate-capacity tuple (A, cj) is called 0-achievable subject to a 
generalised routing constraint if there exists a collection of routing subnetworks 71 and subnetwork capacities 
Ci > satisfying (Rl) and the following condition: 

(R2') for any source s £ S and any sink node u G D(s), 

X(s)< °i- (l05 ^ 

i:in(u)nT;#0 and i/(T;) = s 

Again, each 0-achievable tuple (A, u) subject to a generalised routing constraint is fit for a zero-error routing scheme 
as follows: Let Y s be a g-ary row vector of length A(s). Instead of partitioning a source message Y s into independent 
pieces, one can encode (e.g. using simple codes for erasure channels) Y s into 2~2i- v (Ti)=s Ci 9~ arv symbols such that 
any A(s) encoded symbols can reconstruct Y s with no error. These 2~2i-v(%)=s c * symbols will be forwarded via 
the routing subnetworks % (where v(%) = s) to sink nodes in D(s). As before, all intermediate network nodes 
perform only store-and-forward operations. The condition (R2') then guarantees that each sink node u g D(s) 
receives at least A(s) coded symbols of Y s . Hence, the node can decode Y s without error. 

In the following, we will characterise the set of 0-achievable tuples subject to a (generalised) routing constraint 
using a similar framework as developed in Sections [III] and [IV] Developing a characterisation within this same 
framework provides a convenient way to evaluate how a routing constraint may reduce the set of 0-achievable 
tuples. We should point out that we are not the first to characterise 0-achievable rate-capacity tuples subject to 
(generalised) routing constraints. In fact, if 

|head(e)| = 1, Ve € £ , 

then the characterisation of 0-achievable rate-capacity tuples subject to (generalised) routing constraint can be 
obtained by solving variations of the fractional Steiner tree packing problem fl6l . Our characterisation is however 
unified with the entropy function formulation used for network coding and highlights the differences (and similarities) 
between different characterisations with or without (generalised) routing constraints. 

So far we have seen that entropy functions and representable entropy functions were the key ingredients in 
characterising the capacity regions for general network codes and for linear codes. For networks with routing 



constraints, we introduce almost atomic functions which in Theorem 5.1 below will provide the corresponding 
characterisation of the set of 0-achievable tuples. 

Definition 20 (Atomic rank function): A rank function h E W[S U £} is called atomic if there exists TC5Uf 
such that 

|i if/3nr^0 

HP) = < (106) 
otherwise. 



It is called almost atomic if it can be written 



where for all i, c, > and h 1 is atomic. 
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Let Taa(P), or simply Taa, be the set of all almost atomic rank functions in %{S U £]. It can be easily proved 
that Taa is a closed and convex cone contained in T*. Thus, almost atomic rank functions are entropic. 

Proposition 5.1: Let h be an atomic function such that there exists nonempty subset T C SU£ and ( |106| > holds. 
Then T is a routing subnetwork (for network coding problem P) if and only if 

fteC T (P)nCi(P). 

Proof: We first prove the only-if part. Let T be a routing subnetwork. By definition, |Tn <S| = 1. It can be 
verified directly from definition that 

ms) = x>oo- 

Hence, /i e C|(P). It remains to prove that h £ Cj(P). 

For any e £ £, if e T, then /i(in(e), e) = /i(in(e)) by ( |106| l. On the other hand, suppose e G T. Again as T 
is a routing subnetwork, in(e) n T is nonempty. Thus, 

ft,(in(e),e) — 1 — h(\n(e)). 

Hence, h(\n(e),e) = /i(in(e)) for all e £ £. Consequently, h £ Cj. The only-if part follows. 
Now, we will prove the //-part. Suppose e £ Tfl £ . Then h(e) = 1. As h € Cj, 

/i(e,in(e)) = /i(in(e)) (107) 

and consequently /i(in(e)) = 1. By ( |106| l, 

Tnin(e)^0 (108) 

and condition 2) of Definition [17] is satisfied. 

As h £ C\, it can verified directly that |T fl 5| < 1. Since the network G is acyclic and there are only finite 
number of links, there must exist at least one s £ S such that s £ T. Consequently, \Tfl S\ = 1. Hence condition 



1) of Definition 17 is satisfied and the proposition follows. ■ 
Theorem 5.1 (Routing capacity): A rate-capacity tuple (A,w) is O-achievable subject to a routing constraint if 
and only if 

(A,«) £ CL( P rojp[rAAnc T nc D nC|]). 

Proof: We will first prove the only-if part. Suppose (A,w) is O-achievable subject to a routing constraint. 



By Definition 18 there exists a collection of routing subnetworks % and nonnegative real numbers c, such that 
conditions (Rl) - (R3) hold. 



By Proposition 5.1 each % is associated with an atomic rank function h l £ Cj n C\ n Taa such that 



l if p n % f 

h % (J3) = { (109) 
otherwise. 
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For all sink nodes u 6 D(v(%)), (R2) implies that /i l (in(u)) = 1. Hence, 1 = h l (\n(u)) = h % (\n(u), v{Ti))- On 
the other hand, if s ^ v{Ti), then s % and /i l (in(u)) = h l (\n(u), v(%))- Consequently, h % £ Co- 
Let h = J2i Cih % . Since 

h l £ r AA n c T n c D n Ci 

for all i, h is also in Taa fl Ct fl Cd fl C|. Finally, (R2) and (R3) imply that for any s £ S 

A(s) = c * ( n °) 

i:v(Ti)=s 

= J2*w(s) (in) 

i 

= h{s). (112) 

Similarly, (Rl) implies w(e) > h(e) for all e £ £. Thus, (A, a;) e CL(proj p [/i]) and the only-if part follows. 

Now, we will prove the if-part. It is easy to prove that if (A, to) is O-achievable subject to a routing constraint, 
then all tuples in CL(A,w) are also O-achievable. Therefore, it is sufficient to prove that proj p [/i] is O-achievable 
subject to a routing constraint for all 

he r AA nc T nc D nC\. 

Since h is almost atomic, there exist atomic functions 

fl if 13 n 7-^0 
h t (fi) = \ (H3) 
I otherwise. 

such that h = J2i c i^ 1 - 

As each h l is entropic (and hence polymatroidal) and c, is nonnegative for all i, 

c t h l £ C T n C D n C\ 

i 

implies that 

h l eC T nC D nC[, Vi. 



By Proposition 5.1 each % is in fact a routing subnetwork. Also, h % £ Co implies that for any u £ D(v(%)), 

h(\n(u)) = h(\n(u),v(Ti)) = 1. 

This implies in(u) n % 7^ and hence (R2) is satisfied. 
For any s £ S, and u £ D(s), 

h(s) = J2c i h i (s) (114) 

i 

i:v(Ti)—s 

where (i) follows from the fact that h l (s) = if v(T) 7^ s. Hence (R3) is satisfied. Condition (Rl) can also be 
proved directly. The if-part is then proved. ■ 
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Using a similar approach as in Theorem |5.1| we can also characterise the set of O-achievable rate-capacity tuples 
subject to the generalised routing constraint. 

Theorem 5.2 (Generalised routing capacity): Consider a network coding problem P. A rate-capacity tuple (A, u>) 
is O-achievable subject to the generalised routing constraint of Definition [19] if and only if 

(x,oj) g CL(proj* P [r AA nc T nCi]). 

where 

proj P [/i](s)= min h(s A in(u)) (116) 

pro$[h](e)±h(e). (117) 
Proof: Starting with the only-if part, suppose (A, u) is O-achievable subject to the generalised routing constraint. 



By Definition 19 there exists a collection of routing subnetworks %, and nonnegative constants Cj such that 
conditions (Rl) and (R2') hold. Each T% is associated with a rank function h l G Taa H Cj n C\ defined as in ( [109) 1. 
Let 

h = c %h % - 



Again, (Rl) implies that 



By (R2'), for any s G S and u G -D(s) 



w(e) > /i(e), Ve G £. 



A(s)< ^ c, (118) 

i:in(«)n7j^0 and sG7; 

( = } 5^ Ci^s A in(u)) (119) 



where (a) follows from that 

ft'(aAin(u)) 

As (|120[) holds for all m G D(s), we have 



M>Ain(u)) (120) 

1 if in(u) n 71 7^ and s ET% 
otherwise. 

A(s)<proj;[fc]00. (121) 

Thus, (A, to) G CL(proj P [/i]) and the only-if part follows. 

Now, we will prove the //'-part. Let h G Taa H Cj H C| and (A, w) G CL(proj P [/i]). As before, we can construct a 
collection of functions h l , routing subnetworks % and positive constants c,; such that h = 2~2i Cih 1 and ( |1 1 3| > holds. 
By definition, 

w(e) > /i(e), Ve G f . 
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and for any s 6 S and u € -D(s), 



A(s) <proft[/i]( a ) 



(122) 



< /i(s A in(u)) 



(123) 



E 



(124) 



i:in(u)n7i#0 and sGT; 

Then both (Rl) and (R2') are satisfied and the result follows. 



B. Heterogeneous networks: Partial routing constraints 

In the previous section, we considered two varieties of routing schemes defined by routing subnetworks. In 
those schemes, each subnetwork is dedicated to sending a segment of data from a source to its respective sinks. 
Intermediate network nodes can only perform store-and-forward operations to forward the same data segment across 
a subnetwork. As only store-and-forward operations are performed, the computational requirements for intermediate 
nodes are relatively low. Despite this advantage, such routing-based schemes may suffer loss in throughput, as 
evidenced by the now famous example of the butterfly network ifTTIl . In some cases, this loss can be significant. 

In this section, we will consider more advanced schemes where some subsets of the intermediate nodes have 
sufficient computational resources to permit more sophisticated data processing in order to increase the throughput. 
Thus the network now consists of two types of nodes: routing nodes and network coding nodes. As demonstrated 
by the butterfly network example, there are known instances where the maximum possible throughput can in fact 
be achieved with only one network coding node, and all other nodes performing routing. 

The aim of this section is to extend out methodology to such heterogeneous networks. As a first step, we need 
to clarify the concept of store-and-forward. Figure [2] is a subnetwork of G such that the node visa "routing node" 
(where only store-and-forward operation is allowed). The node has two incoming links and one outgoing link. 
Suppose that v receives (bo,b\) from the incoming link e\ and (62,^3) from link e%. A natural question is: If v 
can only perform store-and-forward operations, which types of outgoing message it can send? 



Figure 2. Partial routing contraint. 

Naturally, we should allow the routing node v to send hi for any i = 0,1,2,3, but not 60 © b-3. The question 
however is if v can send bo b\ which is a function of the incoming message from ei ? 
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In this paper, we will assume that v, as a routing node, is in fact permitted to perform intra-edge coding and 
send 60 © b\. We do not allow inter-edge coding across different incoming links. Using this slightly generalised 
definition of routing, we can once again use the tools developed earlier to characterise O-achievable/e-achievable 
rate-capacity tuples for heterogeneous networks with "partial routing" constraints. 

Definition 21 (Routing nodes): With respect to a network code {Yt, f £ S U £}, an intermediate node v is said 
to be a routing node if for all outgoing links e of v (i.e., tail(e) = v), there exist auxiliary random variables 

PWein(e)} 

such that 

H (Y e I Y U , f G in(e)) = H (Y f>e , f G in(e) \Y e ) = (125) 
H (Y/, e I Yf) = 0, V/ G in(e). (126) 

In other words, the outgoing message Y e is formed by a set {Yf >e ,f € in(e)} such that each element Yy,e is a 
function of the incoming message Yf from the link /. Routing links are defined as outgoing links from a routing 
node. 

Let gCfbe the set of all routing links. In other words, e G Q if and only if tail(e) is a routing node. We refer 
to q as a partial routing constraint. 

Definition 22 (Network code with partial routing constraints): A network code satisfying the partial routing con- 
straint q is a set of random variables 

{Y, i G S U £} U {Y jie , e G g,j G in(e)}. 

such that {3^, i G 5Uf} is an ordinary network code according to Definition [2] and in addition, ( |125| l and ( |126[ ) 
hold for all e E g. We refer to such a code as a g-network code 

To go along with our definition of a network code with partial routing constraints, we need to update our definition 
of fitness. 

Definition 23 (Fitness of a network code with partial routing): A rate-capacity tuple (A, oS) is fit for a p-network 
code {Yi, i G S U £} U {Y he . e 6 ft j e in(e)} if 

A(a)<log|SP(Y,)|,VseS (127) 
w(e)> ^ log|SP(y / , e )|,Vee g (128) 

/ein(e) 

w(e) >log|SP(y e )|,Ve^ e . (129) 

Note that we use ( |128| l rather than (|6| to highlight that the outgoing message Y e will not be jointly compressed by 
the routing node. The set of 0-achievable and e-achievable rate-capacity tuples subject to a partial routing constraint 
q can be defined similar to Definitions [3] and [6] 

Our approach for characterisation of the set of 0-achievable or e-achievable rate-capacity tuples for g-codes is to 
transform the problem with partial routing constraints into an equivalent unconstrained problem (G^, M). 
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Given a network coding problem (G, M) with partial routing constraint g, define G^=(V ,£ ) as follows 

1) Add new nodes: 

V' =VU{Vy, e] ,ee&jein(e)}. 

2) Add new links: 

£' = £ U {[j,e],e G g,j G in(e)} 

such that 

head([j,e]) = tail(e) (130) 
tail(L?,e]) = V b>] . (131) 

3) Modifying existing link connections: For all f £ £, the set head(/) is modified as 

(head(J) \ {tail(e) : e G g, f G in(e)}) U {V [M : e G g, f G in(e)}. 

In other words, if a link / was directed to a routing node tail(e) for some e G Q, it will be redirected to the 
newly created node Vj/ ie j. 

Figure [3] is an example illustrating how to modify a network to remove the partial routing constraint. In this 
example, Figure |3ja) is one part of the network where e G g is a routing link. Figure |3jb) shows how that part of 
the network is transformed. In the new network, we no longer impose any routing constraint. 



fi h 

\ / 

%i,e]q 9 V [h,e] 

[/2,e]\ >[fi,e] 



(a) Original network. 



Figure 3. Removing a routing contraint. 



(b) Modified network. 



Theorem 5.3 (Network transformation): A rate-capacity tuple (A, to) is O-achievable/e-achievable with respect to 
a network coding problem (G, M) subject to a partial routing constraint g if and only if there exists a O-achievable/e- 
achievable rate-capacity tuple (A ,u> ) for the network coding problem (G^, M) where 



A s = A s , Vs G S 
oj e = oj c , Ve G £ \ g 



E < 

/ein(e) 



J [f,e] 



Ve G g. 



(132) 
(133) 
(134) 
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Proof: By direct verification. ■ 
The construction of together with relationships between X/lo and A'/u/ remove the partial routing constraint 
by making the capacity of the links e <E g in sufficiently large such that network coding is never required at 
head(e), which has sufficient capacity to simply forward all of the incoming messages. Also note that the choice 
of Wy. e j are free, apart from the constraints ( |134) , 



As a corollary of Theorem 5.3 all of the results obtained in the earlier sections can also be applied to network 



coding problems with partial routing constraints. 

VI. Secure Network Codes 

So far we have considered two classes of constraints on network codes. In Section |IV] we considered linear 
network codes, which may be attractive for practical implementation. In Section [V] we considered networks where 
some, or all of the nodes are constrained to perform only store-and-forward types of operations. Another important 
class of constraints to consider for network coding are motivated by security considerations. The objective is to 
determine the achievable network coding rates when we require secret transmission that is impervious to specified 
eavesdropping attacks. 

Assume that there are \1Z\ adversaries in the network. Adversary r € 1Z observes messages transmitted along links 
in the set B r C £ and aims to reconstruct the set of sources indexed by A r C S. We refer to W={(_4,., B r ), r £ 1Z} 
as the wiretapping pattern of the network. 

We will use the notation P = (G, M, W) to denote the network coding problem subject to a secrecy constraint, 
also referred to as a secure network coding problem. Here, G and M are as usual the network topology and the 
connection constraint. The secure communications objective is to transmit information over a network satisfying 
the multicast requirements while simultaneously ensuring that the eavesdroppers gain no information about their 
desired sources. 

Before we characterise the set of O-achievable/e-achievable rate-capacity tuples for secure network coding, we 
need to point out a significant difference between ordinary and secure network codes. Without a secrecy constraint, 
the transmitted message on any network link e can be assumed without loss of generality to be a function of the 
source inputs and received messages available at the node tail(e). However, when secrecy constraints are enforced, 
it is usually necessary to encode messages stochastically to prevent an eavesdropper from learning any useful 
information about its desired sources. 

Definition 24 (Stochastic network code): A stochastic network code is defined by a set of random variables 

{Y f ,f eSUS U V} (135) 
with entropy function h such that Y s is uniformly distributed over its alphabet set for all s G S and 

/ieCi(P)nC T (P) 

where 
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, NA ft £ H[SU£ U V] : 
G(P)=4 } (136) 

I ft(«s,v) = £, g5 fc(«) + £ ue vM«) 

, N A \h £ H[SU£U V] : ] 
Ct(P)^{ }. (137) 

| h(s | in(e),tail(e)) = 0,Ve e £ J 

In the definition, {Y s , s £ S} and {Y e , e £ £} are again the set of random sources (indexed by s £ S) and the 
set of messages (transmitted on hyperedges e £ £). The random variables {Y U) u £ V} can be thought of as the 
random keys available at nodes «eV for stochastic encoding. Specifically, each link e £ £ is associated with a 
local encoding function such that 

Fe = ^ e (yi,iein(e),y tail(e) ). (138) 

Clearly, we have 

H(Y e | Y u ie in(e),y tai , (e) ) 

and hence ( |137[ ). Furthermore, we want to point out that the random keys {Y u , u £ V} are not like the usual secret 
keys that are privately shared between nodes in Shannon-style secure communications. Instead, they are locally 
(and hence independently) generated at each node. In other words, there are no correlated or common keys shared 
privately between nodes in advance. Therefore, we will assume that {Yf, f £ S U V} are mutually independent, 
and require ( |136| l to hold. 

A. Weak Secrecy 

Definition 25 (Weak secrecy): For a secure network coding problem P = (G, M, W), a rate-capacity tuple (X,u) 
is called O-achievable subject to a weak secrecy constraint if there exists a sequence of stochastic network codes 

$" = {Yp : f £ £ U S U V} 

and positive normalising constants c„ such that 
(SI) for all e £ £ and s £ S, 

\im c n \og\SP(Y c n )\<uj(e), (139) 

n—too 

lim Cn log|SP(F s ")| > X(s). (140) 



(S2) for s £ S and u £ D(s), 



(S3) For all r £ TZ, 



H(Y a n \Y?, /ein(u)) = 0. 



lim c n I(YX r ;Yg r )=0. (141) 
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Similarly, a rate capacity tuple (A, to) is called e-achievable subject to a weak secrecy constraint if there exists 
a sequence of network codes 

$" = {Yf ; f g £ U 5 U V} 

satisfying (SI) and (S3) and the following condition (S2'): 

(S2') for s E S and u E D(s), there exists decoding functions g™ u such that 

lim Pr(Y s n ? glJXf : / g in(«))) = 0. 

The following theorem can be proved by using the same technique as in Corollary |2.1| For brevity, we state the 
theorem without proof. 

Theorem 6.1 (Outer bound): Consider any secure network coding problem P = (G,M,W) subject to a weak 
secrecy constraint. Let 

A \h E H[SU£UV] :h(s\ in(it)) = 0, 1 
C D (P)= < > , (142) 

[ Vs g S, u g D{s) J 

C s (P)={h E H[S U£(JV] : h (A r A B r ) = 0,Vr g K} . (143) 
If a rate-capacity tuple (A,w) g x(P) is e-achievable, then there exists 

ft g f *(5 u £ u v) n Ci(P) n c T (P) n C D (P) n C S (P) 

such that 

A(s) < /i(s), Vs g 5, (144) 
w(e) > h(e), Ve g £. (145) 

Or equivalently, 

(A,w) g CL(proj P [f*(5u£:u v)nCi(P) nc T (P) nc D (P)nc s (P)]). 

The condition ([142) corresponds to the decoding constraint, requiring that any node u E D(s) can decode the 
source s with vanishingly small error. The condition ( |143[ ) is the secrecy constraint, ensuring that an adversary can 
learn no information about the sources it is interested in. 

Unlike in Theorem 1 3. 1| we do not claim tightness of the outer bound even for colocated sources. This is because 
secure network nodes may locally generate random keys for the purpose of stochastic encoding. These keys, to a 
certain extent, behave like sources (with no corresponding sink nodes), and hence the colocated source condition 
fails, even if the actual sources are colocated. 



B. Strong Secrecy 

Weak secrecy requires that the amount of information leakage vanishes asymptotically after normalisation. In 
other words, the amount of information leakage is negligible (when compared with the size of the source messages). 
We can also consider a strong secrecy constraint, where we require the information leakage to be exactly zero. 
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Definition 26 (Strong secrecy): A rate-capacity tuple (X,oj) is O-achievable subject to a strong secrecy constraint 
if there exists a sequence of network codes 

$« = {Y} 1 :f e £ U S U V} 

and positive normalising constants c„ satisfying (SI), (S2) and the following condition 
(S3') I{Y% r \Yg r ) = for all n and reK. 

Similarly, it is e-achievable subject to a strong secrecy constraint if the sequence of codes satisfies (SI), (S2') and 
(S3'). 

In general, it is very hard to characterise the set of achievable rate-capacity tuples subject to a strong secrecy 
constraint, even implicitly via entropy functions. However, under the additional constraint of linearity, the set of 
O-achievable rate-capacity tuples can in fact be characterised implicitly via the use of representable functions. 

Definition 27 (Strongly secure linear network codes): Let 

{Yf.f eSUSUV} (146) 



be a stochastic network code (according to Definition 24 1 for a secure network coding problem P on a network 
G = (V,£). The code is called q-linear (or simply linear) if it satisfies the following conditions: 

1) For / G SUV, Yf is a random row vector such that each of its entries is selected independently and uniformly 
over GF(q). 

2) For any e £ £ , there exists a linear function <fi e such that 

Ye = 4> e (Ym(e),Yta\\{e))- 

A network coding problem is said to be subject to a q-linearity constraint if only q-linear network codes are allowed. 

Theorem 6.2 (Strongly secure linear network codes): Consider a secure network coding problem P where |0(s)| = 
1 for all s 6 S. A rate-capacity tuple (A,w) is O-achievable subject to q-linearity and strong secrecy if and only if 

(A,w) € CL(proj P [T*nCine T nC D nCs]). 

Proof: We first prove the only-if part. Suppose (A, uS) is O-achievable subject to linearity and strong secrecy 
constraints. By definition, there exists a sequence of linear codes 

{Ypj esueuv} 

with entropy function h n and normalising constants c n > such that (SI), (S2) and (S3') hold. By (S2) and (S3'), 

h n e T;nCinc T nc D nc s . 

And hence, c n h n e T* n C\ n Cj n C D n Cs- Consequently, 

(A,cj) e CL(projp[T*nCinc T nc D nCs]). 

and the only-if part follows. 
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Now let 

h et*r\Cinc T nc D nc s . 

To prove the if '-part, it suffices to prove that proj p [ft] is O-achievable. As in the proof of Theorem 4.1 there exists 
a sequence of g-representable functions 

h n e T* n C T n C D n d (147) 

and positive scalars c n such that 

ft = lim c n h n . (148) 
For each n, ft™ induces a zero-error linear network code 

ft n ,ie5u£UV} (149) 

such that 

HQY) = h n (a),VseS (150) 
H(Y?) = h n (e),Vee£. (151) 

However, the linear network code ( |149[ > need not be strongly secure (i.e., ft™ <E Cs). In the following, we will create 
from h n another representable function g n such that 

lim c n g n = lim c n h n = ft (152) 

n— >oo n— >-oo 

5 ™ g T*nCinc T nc D nc s . (153) 

Since ft™ is g-representable, there exists subspaces 

{V",i e SU£U V} 

such that for all a C 5 U £ U V, 

ft™ (a) = dim(V; l ,j G a). 

For each r G 72. and s G >4 r , we define 

W;%=V" n (V", / G Br) . (154) 

Then by direct verification, 

ff(W" s ) = ft"(sA6 r ) 

and hence lim n _>oo c„i7 (W" s ) = 0. 
Let 

WJ=(W n s) r G ft and s G A). 
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By Lemma 4. 1 for every s 6 S, there exists a subspace U™ of V™ such that 

dimV; 1 = dimU'; + dirnW™ (155) 

{0} = U"nW;\ (156) 
For any s G S, let O(s) be the unique source node where the s th source is available. Let 

K = (K^ , :,0(s)=u), VueV (157) 



On the other hand, 



As 



we have 



Since /i"eC| : 



As U s C V s for seS, 



U' e l =V J e \ Vee£. (158) 



U" n (V 1 ;, / e Br) c v 1 ; n (W], / e B r ) (159) 

= W" S . (160) 

u?nwp )S = {o}, 
o:n(VJ,/eB r > = {0}. 

dim(V s ,s e 5) = ^dimV s . (161) 

dim <U^, s € S) = ^ dim U£ . (162) 



se5 

Let g n be the representable function induced by 

{U n f ,f £ SU£U V}. 

Then, it can be directly verified that 

1) g n € T* n C| n C T n C D n C s where ,g n is the rank function induced by {YJ 1 , f G 5 U £ U V}, and 

2) linin^oo c„.g" = h. 

Consequently, projp[5 n ], and also proj p [c„g"] and proj p [/i], are 0-achievable subject to the two constraints. ■ 
C. Secret Sharing 

In secret sharing [18], a secret is shared among a set of users JV where each user holds a component of the secret. 
The main objective is to ensure that only specified legitimate subgroups of users (indexed by a subset A of AO 
can successfully decode the secret. All other illegitimate subgroups of users should receive no information about 
the secret. The collection of all legitimate subsets il is called the access structure of the secret sharing problem. 
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We can reformulate a secret sharing problem as a secure network coding problem P = (G, M, W). In this secure 
network coding problem, there is only one source (the secret) which is only available at the source node u* . There 
are \Af\ intermediate nodes, each of which represents a user. The transmitted message an intermediate node (or a 
user) received from the source corresponds to the component of the secret that it holds. There are |0| sink nodes 
indexed by {v a , a € fl}. The sink node v a is connected to nodes (or users) i £ a and aims to reconstruct the secret. 
We also assume that each /3 g" fi, is associated with an eavesdropper who can wiretap the set of edges {ei,i € /?}. 
The secrecy constraint implies that all illegitimate subgroups of users have no information about the secret. 

Mathematically, the secure network coding problem is defined as follows. 

1) G = (V,£) where V = {u*} UjVU {v a ,a G fi} and £ = {e h G AT}; 

2) for any i £ AT, tail (e^) = u*, head(ei) = {i}, tail(/j) = i and head(/i) = {v a : i £ a}; 

3) M = (S, O, D) where (i) S = {1}, (ii) 0(1) = {u*} and (Hi) D(l) = {v a ,a € fl}; 

4) W = {(1, ei,i £ ft) : (3 C TV and & Q}. 

By translating a secret sharing problem to a secure network coding problem, the results obtained in this paper can 
be applied to secret sharing. 

VII. Challenges in characterising achievable tuples 

Characterising the set of achievable rate-capacity tuples for a network coding problem is generally very hard. So 
far, there are only a limited number of scenarios where the sets of O-achievable/e-achievable rate-capacity tuples 
have been explicitly determined. One scenario is when there is only one source, |<S| = 1 and no partial routing 
constraint or secrecy constraints. In this case, the set of achievable rate-capacity tuples is explicitly characterised 
by the cut-set bound Ifl9l . If a secrecy constraint is additionally imposed, the set of achievable tuples can still be 
determined if ( i) all links have unit capacity and ( ii) the eavesdropper is t-uniform in the sense that an eavesdropper 
can wiretap any t links in the network [20|, f6), ETI . In both cases, linear codes are optimal. 

It is natural to wonder whether there is any hope that wide classes of network coding problems could have simple, 
explicit characterisations. In the following two subsections, we will show that even in some very simple scenarios, 
finding the set of achievable rate-capacity tuples can be extremely hard. The first scenario will be an incremental 
multicast. The second scenario is a secure multicast. 

A. Incremental Multicast 

In a incremental multicast problem, sources are totally ordered such that a receiver who wants to reconstruct 
source s is also required to reconstruct all other sources i for i < s. Here, the symbol < is defined with respect 
to the total ordering of the sources. Incremental multicast is common in multimedia transmission, where data such 
as video or audio may be encoded into multiple layers. A layer can only be used for reconstruction at a receiver 
if all its previous layers are also available. This leads directly to an incremental multicast problem. 

We will construct the simplest case of a incremental multicast problem involving two layers, colocated at the same 
source node. Hence, there are two types of receivers: those which request source 1', and those which request both 
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sources (1' and 2')- Even for such a simple setup, we will show that determining the set of achievable rate-capacity 
tuples can be as hard as solving any network coding problem in general. 

To prove our claim, we consider any ordinary network coding problem P = (G, M), where sources may or may 
not be colocated. We will transform P into a two-layer incremental multicast problem Pt = (GT, M^) and prove 
that determining the set of achievable tuples in the incremental multicast problem Pt is at least as 



in Theorem 



7.1 



hard as determining the outer bound of Corollary 2.1 for the problem P. 

The network G^ = (V^,£^) is obtained from its subgraph G by adding nodes and hyperedges 

V f =V U { 7s , r,, u , s&S,ue D(a)} U {V>, 0, r h rj*} 
£ ] =E U {a s , b s , c s , seS}U {d s . u , s£S,ue D{s)} 



with connections 



tail(a s 
head(a s 

tail(b s 
head(b s 

tail(c s 
head(c s 
tail(d s ,„ 
head(d SjU 



= 0( S )U{ ls ,if}. 



{-Ys,V^}u\J{Tj,v,veD(j)} 



7s 



{r),r)*}u{T s , v ,v e D(s)} 



(163) 
(164) 
(165) 
(166) 

(167) 
(168) 
(169) 
(170) 



for all s G S,u e D(s). 

In addition to augmenting the network, we also need to define the connection requirement = (S* ,0* ,D*). 
In our two-layer incremental multicast problem there are two sources indexed by 



St4{i',2'}. 



All the sources are colocated at the node d>, i.e., 



0^(1) =O f (2') = 0. 
Finally, the destination location mapping is defined as 

Dt(l') ={T s , u ,seS,u£D{s)}U{r),r)*,i;}, 
rf(2)={r 1 ,r,*}. 

Figure |4] exemplifies how to convert an ordinary network coding problem P (which has two sources) into a two 
layers incremental multicast problem. Here, a source will be denoted by a double circle, and a sink by an open 
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square. The label beside a source or a sink denotes the index of the sources which are available or are required at 
the node. Note that in the figure, the sink nodes u and u' in the original problem P are no longer sink nodes in 
the incremental multicast problem P^. Any rate-capacity tuple (A, u>) € x(P) f° r P induces another rate-capacity 




(a) Network G 
Figure 4. Transformation from G to G^. 




(b) Network G+ 



tuple 

(At jW t)Art( AjW ) 

in x( pt ) f°i" pt such that 

w f (e) = u(e), 
J(a s )=J{b s )=^{c s ) = X(s), 
^( d s,u) = A(s), 
At(l') = At(2') = ^A( S ). 

where e G £, s € S and u G D(s). 

Theorem 7.1: Let (A,cj) be a rate-capacity tuple in x(P). Then the following two claims are valid. 

1) If Tt(A,w) G CL(projp t [f*(Pt)nC|(Pt)nCT(P t )nC D (P t )]), then 

(A,w) g CL(proj P [f*(P)nc l (P)nc T (P)nc D (P)]). 

2) If a rate-capacity tuple (X,oj) for P is O-achievable, then T^(X,uj) is O-achievable with respect to P^. 
Proof: See Appendix [B] 



(171) 
(172) 
(173) 
(174) 



Using a similar arguments as in Theorem 7.1 we can also prove the following theorem, whose proofs we omit 
for brevity. 
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Theorem 7.2: Let (A, w) be a rate-capacity tuple in x(P). Then the following two claims are valid. 

1) If Tt(A, w) € CL(proj pt [T*(Pt) n Q(Pt) n Cj(P^) n C D (Pt)]), then 

(A,w) e CL(proj P [T;(P) n Ci(P) n c T (P) n c D (P)]). 

2) If (A, a;) is O-achievable with respect to P subject to the g-linearity constraint, then T'(X,u>) is also 0- 
achievable with respect to P^, subject to the g-linearity constraint. 

Corollary 7.1 (Colocated sources): Suppose all the sources are colocated in P. Then 

1) (A, cj) is O-achievable with respect to P if and only if T^(X,u>) is also O-achievable with respect to PL 

2) (A,w) is O-achievable with respect to P subject to the q-linearity constraint if and only if T'(X, ui) is also 
O-achievable with respect to pt subject to the same linearity constraint. 



Proof: A direct consequence of Theorems 3.1 4.1| |7.1|and 7.2 



In El, a specific network coding problem P was proposed, such that all sources are colocated and that determining 
the set of e-achievable rate-capacity tuple is at least as hard as determining the set of all information inequalities. 
Furthermore, it was also proved that linear codes are not optimaj^] Therefore, by Corollary |7.l| we can directly 
prove the following proposition. 

Proposition 7.1: There exists a two-layer incremental multicast network coding problem P^ such that 

1) Characterising the set of achievable rate-capacity tuples for a two-layer incremental network is in general no 
simpler than determining the set of all information inequalities. 

2) Linear codes are not optimal. 



B. Secure Multicast 

In this subsection, we consider another scenario, very simple secure network coding problem with only one 
source. We will again show that solving the resulting secure network coding problem can be as hard as solving 
a general multi-source unconstrained network coding problem. Our approach is essentially the same as that used 
in the previous subsection for the incremental multicast. We will construct a simple single-source secure network 
coding problem P* = (G*, M*, W) from an ordinary network coding problem P = (G, M). We will then show 
that solving the so-constructed secure network coding problem is as hard as solving the original network coding 
problem. 

Construct the network G^ in P* by adding nodes and hyperedges 

V*=V U ty>„7„ 0, ltt , Ti.u, s £ S, u € D(s)} U {0, 77} 
£$=£ u {a Sl b Sl c s ,e s ,s e S} U {d s . Ul w SiU , s G S,u € D(s)} 

5 Linear codes are not optimal in the sense that there exists a O-achievable rate-capacity tuple (A,u) for P which is not achievable when 
subject to the additional linearity constraint. 
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tail(et s ) = <t> 
head(a s ) = O(s) U {77} U { 7s } 

tail(6 a ) = 4> 
head(6 s ) = { 7s ,r ? }u[J{^ i ,iGZ>(i)} 

tail(c s ) = 7,, 
head(c s ) = {ip s } 
tail(d s , u ) = u 
head(d s ^) = t s>u 
tail(e s ) = 4> 
head(e s ) = {ip s } U {r S)i , i € £>(s)}. 

for all s G G -D(s). 

The connection requirement = (S*, O*, .D*) is defined as follows: 

0{l)H 

D(1)±{t s>u , seS,ue D(s)} U {V> s , s G 5} U {77} 
W^{(^ 1) B 1 ),(^ 2 ,B 2 )} 

where 

Ai=A 2 = 1' 
Si = {a s ,s G S} 
B 2 = {b s ,seS}. 

Figure [5] exemplifies how to convert an unconstrained network coding problem P into a single-source secure 
network coding problem. 

As before, for any rate-capacity tuple (A, ui) G x(P), we 

define a tuple (A 1 , ufi)=T^{\oj) £ xi? 1 ) as follows: 



w*(e)=u;(e), (175) 

w*(a.) = w*(&,) =« t (c) = A(s), (176) 

w*(d.,„) = A(s), (177) 

w*(e,)= A W ^ 178 > 

ie5\{s} 

A*(l') (179) 

s£S 
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(a) Network G (b) Network G* 

Figure 5. Transformation from G to G*. 



for all e € £, s E S and u G D(s). 

Theorem 7.3: Let (A, uS) be a rate-capacity tuple in x(P). Then the following two claims are valid. 

1) If T x (\,u) G CL(proj(^)) for some 

fc* e f*(P :t )nCi(P :t )nCT(P 1: )nCD(P 1: )nCs(P 1: ). 

then 

(A,w) € CL(proj(f*(P)nC,(P)nC T (P)nC D (P))). 

2) If a rate-capacity tuple (A,w) for P is O-achievable, then T*(X,u>) is O-achievable with respect to P* subject 
to the strong secrecy constraint. 

Proof: See Appendix [C] ■ 
The following theorem is the counterpart of Theorem |7.2| Again, its proof will be omitted. 
Theorem 7.4: Let (A,w) be a rate-capacity tuple in x(P). Then the following two claims are valid. 

1) If T*(A,w) e CL(proj p} [f*(Pt)nC|(Pt)nC T (P :t )nC D (Pt)nCs(P :t )]), then 

(A, w) G CL(proj P [T;(P) n C,(P) n c t (p) n C D (P)]). 

2) If (A, a;) is O-achievable with respect to P subject to the g-linearity constraint, then T+(A, w) is also 0- 
achievable with respect to P+, subject to the strong secrecy and g-linearity constraint. 

Corollary 7.2 (Counterpart of Corollary \7.1\ : Suppose all the sources are colocated in P. Then 
1) (A, ui) is O-achievable with respect to P if and only if T*(A, uS) is also O-achievable with respect to P+ subject 
to the strong secrecy constraint. 
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2) (A, cj) is O-achievable with respect to P subject to the q-linearity constraint if and only if T*(X, (J) is also 

O-achievable with respect to P* subject to the strong secrecy and g-linearity constraint. 
Proposition 7.2 (Counterpart of Proposition [ZT) : There exists a single source secure multicast network coding 
problem P* such that 

1) Characterising the set of achievable rate-capacity tuples for P+ is in general no simpler than determining the 
set of all information inequalities. 

2) Linear codes may not be optimal. 

Remark: In [6 1, it was proved that in the single-source case, if all links have equal capacity and the eavesdroppers' 
capability is limited by the total number of links it can wiretap, then linear network codes are optimal. Therefore, 
Proposition |7.2| is indeed a surprising result proving that linear network codes are not optimal in general. 

VIII. Conclusion 

Characterisation of the set of zero-error or vanishing-error achievable rate-capacity tuples for network coding is 
a fundamental problem in multiterminal information theory. In J4|, it was proved that this characterisation problem 
is extremely difficult in general and is as hard as determining the set of all information inequalities. This goes some 
way toward explaining why the problem has so far been solved only for a few special cases. 

The authors in JT| and fl2] used entropy functions to implicitly characterise the set of achievable rate-capacity 
tuples for general networks. Although this characterisation is implicit, it offers insights about the structure of the 
set of achievable tuples. For example, knowing that the set of almost entropic functions T* is not polyhedral, J4| 
proved that the set of achievable tuples also is not polyhedral in general. 

This paper extended |[T) and (2) in several aspects. First, we proved that when sources are colocated, the outer 
bound given in [2, Section 15.5] is indeed tight. In particular, we showed that the set of rate-capacity tuples 
achievable with vanishing error, and the set achievable with zero error are indeed the same. We also gave evidence 
to support our conjecture that the outer bound in [2, Section 15.5] remains tight even when sources are not colocated. 

Secondly, we considered network coding problems subject to several practically-motivated constraints, such as 
linear coding, the restriction of some or all nodes to perform only routing, and security requirements. For these 
cases we characterised the set of zero-error and vanishing-error achievable rate-capacity tuples. Finally in Section 



VII we proved that even for very simple network coding problems, such as the incremental multicast problem and 
the single source secure network coding problem with arbitrary wiretapping patterns, characterisation of achievable 
tuples is as hard as the characterisation problem for general unconstrained network coding. We also proved that 
linear codes are suboptimal for both the general incremental multicast problem and for the single source secure 
network coding problem. 
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Appendix A 
Proof of Proposition s. 31 

Consider the following combinatorial problem. Suppose that there are k boxes, t of which are nonempty. If we 
randomly select m distinct boxes, then the probability that all selected boxes are empty is upper bounded by 

Pr(all m boxes are empty) < I 1 — — J (180) 

Let k(c) = (1 — c) x / c . Since lim c _j. + K ( c ) = exp(— 1), there exists < 5 < 1 such that k(c) < S for all 
< c < 1. Hence, (1 1 80b can be relaxed as 



Pr(all m boxes are empty) < S tm/k . (181) 

Let (U,V) be a pair of quasi-uniform random variables. As V is uniform over its support, |SP(V)| = 2 H ( y \ 
Let 

oH(V) 

m = H(UV) 2 wwm - 

Partition the set SP(V) randomly into 

oH(V\U) 

2 H W/m - 5 

H{UV) 2 

subsets, each of which is of size m. These disjoint subsets will be denoted by 5(6) where 

b£AvHl,...,2 H(V]U) /H(UV) 2 }. 
For any u £ SP(U) and b £ Ay, let E(u, b) be the event that 

{(«,*) : * £ S(6)}nSP(J7,y) ^ 0. 

In other words, the event is equivalent to the existence of an element i £ 5(6) such that (it, i) £ SP(?7, V). 

In the following, we will prove that the probability of E(it, b) is "arbitrarily close to one asymptotically" for all 
u £ SP(U) and b £ Ay- 

For any u £ SP(U), it is easy to see that 

|{« : (u, w) £ SP{U, V)}\ = 2 H( y\ u \ (182) 

By setting k = 2 H ^> and t = 2 H ^ U K ([T|TJ implies that 

Pr(E(u, 6)) > 1 - 5 H{UV ? (183) 

and hence via the union bound, the probability that the event E(it, b) occurs for all u £ SP(U) and b £ Av is at 
least 

1 _ 2 H(U)+H(V) S H(UV) 2 _ 

This probability approaches to as H (UV) goes to infinity. Consequently, if the entropy H (U) (and hence also 
H (UV)) is sufficiently large, there exists a way to partition SP(V^) such that for any u £ SP(U) and b £ Ay, 
there exists at least one v £ 5(6) such that (u,v) £ SP(U,V). 
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Assume without loss of generality that S = {1, . . . , |<S|}. Repeating the same argument, we can recursively prove 
that for any set of quasi-uniform random variables {[/,-, i E S} and H (U\) sufficiently large, there exists at least 
a way to partition SP(J7 S ) into 

2*Wtfi.-.tfi-0/fl- (U u ..., Uif 

subsets 2 S (6 S ) where 

b s G AA{1, 2 H ^ U — U ^/H (U 1: . . . , Us) 2 } 

such that for any [u±, . . . , it s -i) G SP(C/i, . . . , f/ s -i) and & s G *4 S , there exists at least one u s G S S (6 S ) such that 
(wi, . . . , u s ) G SP(C/i, . . . , U s ), Hence, the proposition is proved. 

Appendix B 
Proof of TheoremI7.1I 



We first prove the first claim. Suppose 

(At, w t ) A T t (A ^ )eCL(projpt[/l t ]) 

for some 

Then by definition, 

\Hi)<rf(i), i = l,2 (184) 
J(a s ) > rf(a s ), VseS (185) 
^(b s ) > h*(b a ), VseS (186) 
u\c s ) > /i f (c s ), VsG<S (187) 
w f (e) > fct(e), Veef. (188) 
Consequently, by {m)-([T74) and ([l85)-([T87j, 

2 E A ^ = +^(6.)) ^ + > hHa s ,b s , S e S) (189) 

where (z) follows from the fact that fv £ r*(Pt) (and hence is polymatroidal). Similarly, we can also prove that 
2 E A ( fi ) = Z> f (a.) + wt ( c -)) ^ E^^) + h ^ c ^ ^ ^(a s ,c s , s G 5). (190) 
Let g be the "projection" of W on H[5U £] such that for any a C £ and /3 C S, 

g{a,P)=rf{a,ai,ie j3). (191) 

In the following, we will prove that 

(A,w) G CL(proj p [ 5 ]) 
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and 

. 9 ef*(P)nCi(P)nc T (P)nc D (P). 

First, tv G r*(P^). Hence, its projection g is also in T*(P). Second, the network G^ contains G as a subnetwork 
and O(s) C head(a s ). In other words, if a node u has access to the source s in the network coding problem P, then 
u also has access to what is being transmitted along the link a s in P^. The link a s in G^ is thus like an imaginary 
source link in G. It can then be verified directly that g E Cj(P). 

Now, we will prove that g 6 C|(P) nC D (P). As 

fet € f*(P t )nc D (P t )nc T (P t ), 

and that the set of links {a s , b s , s € 5} separates the source node <j> from the sink node 77 in G^, 

ftt(l', 2'\a s , b s , s 6 5) = tf(a„ b„ s G S\l', 2') = 0. (192) 

Consequently, 

tf(a s ,b 3 ,s€S)>h\l',2') (193) 

( i/it(i') + ^t(2') (194) 

(ii) 

> A t (l) + A t (2) (195) 

= 2]TA( S ) (196) 

where (i) follows from the fact that G r*(Pt) nC|(P t ) and (ii) follows from ( |184| ). 

Similarly, the set of links {a s , c s , s G 5} separates the source node from the sink node 77* in G^. Hence, 

/it(a s ,c s ,s6 5)>2^A(s). (197) 
Therefore, all the inequalities in (|189[) and (|190|) are in fact equalities. In particular, 



tf(a a ) = tf(b B ) = tf(c s ) = \(8), Vse5 (198) 

and 

^(a s ,6 s , S 6 5) = Ysi^M + hHkj), (199) 
/it(a S! c S! s 6 5) = £(tf(a.) + ^ f (c s )). (200) 
By ( fT99) , ht( aa)S g 5) = Eaes^C *)- Hence ' 

and g 6 C\(P). Furthermore, as 

g(s) = h\a s ) = A(s), Vs 6 5 (201) 
g(e) = h\e) < w(e), Ve € £, (202) 
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we prove that 

(A,w) G CL(proj P [.g]). 

Now, it remains to show that g £ Cd(P)- First, consider any s £ S and u £ D(s). By ( |199| l-( |2"0"0"l >, 

/i f (b s \ s A a s ,b s ) = 0. 
As h) e f*(Pt)nCT(P f ), tf(c s | a s ,b s ) = ht(\n(u) | o s ) = 0. Hence, 

^fev Aa 5 ,6 s ,c s ,in(u)) = 0. (203) 
Together with the decoding constraint (for the receiver r Si „) 

rf(a„ I &s\ s ,c s , in(u)) = 0, 

we can prove that 

h'(a s | c s , in(tt)) = 0. 

Finally, using ( g00) and that /jt(in(w) | a 5 ) = 0, we have /i' (a s | in(u)) = 0. Thus, 

g(a s | in(tt)) = ftt(a a | in(u)) = 

and g £ Cd(P)- The first claim is proved. 

To prove the second claim, suppose (A, oS) is 0-achievable with respect to P. By definition, there exists a sequence 
of zero-error network codes 

{YfJeSUS} 
for P, and a sequence of positive constants c n such that 

lim c n H(Y") < lim c n H\SP{Y?)\ < u(e) (204) 

n— too n— foo 

lim CnHCyp) = lim c n H\SP(Y™)\ > A(s). (205) 

n— too n— >oo 

Assume without loss of generality that 

sp(r s ") = {o,...,|SP(r; i )|-i}. 

For each n, define a new set of random variables 

{f/?,/e«Stu£t} 

such that for any s £ S and u £ D(s), 

1) [7£4y» ; 

2) [/™=y e n ; 

3) {l^ 1 , s G 5} is a set of mutually independent random variables such that each of which is uniformly distributed 
over {0,...,|SP(Y s n )| - 1} and 

fes 
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4) Ul=U™ s +Ul mod | Y? |; 

5) Ul^Ul; 

6) U?A(U? f ,f€S); 

7) u^A(us f ,feS). 

It can then be proved directly that {Uf, f E U £^} is a sequence of zero-error network codes for the network 
coding problem P^. Consequently, T(X,lu) is O-achievable with respect to P^. The theorem is proved. 

Appendix C 
Proof of TheoremI7.3I 



We first prove the first claim. Let 

(A*,wt)4Tt(A,w) G CL(proj pt (/**)) (206) 

for some 

h* Gr*(P t )nC[(P t )nc T (P t )nCD(P t )nc s (P 1: ). (207) 

By (fT73)-p06), for all e G £, s G S and u G D(s), 

^A(i) = Ai(l')</i*(l') (208) 

A(s) =w*(a„) > ft*(a a ) (209) 

A(«) (210) 

A(«)=w*(c,)>/i t (c a ) (211) 

A(«)=w*(d Sltl ) (212) 

\(s)=ui(w s , u )>hi(w S!U ) (213) 

^ A(i)=w*(e.)>^(e.) (214) 

cj(e)=u t (e)>h i (e). (215) 

Using (f208j-(|2T5j, we have 

> = X>*( «) > X>*K) > (216) 
where (£) follows from that ft* 6 r*(P*") and hence is a polymatroid. Similarly, 

^(l')> 53 A(s) =53^(6.) (217) 

sG5 s£»S sG5 

Recall that h* is a rank function in the space W[S$ U £* U V*]. Let 5 be its "projection" on H[<S U £) such that 
for any a C £ and /3 C S, 

g{a, 0)±h*(a, a u i G /3 | V 1 \ {0}). (218) 
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In the following, we will prove that 

(A,w) G CL(proj P [si]) 

and 

g er*(P)nCi(P)ne T (P)nc D (P). 

First, as £ T*(P$), it is obvious that 

$ef*(P). 

Second, notice that the network G* contains G as a subnetwork. Therefore, by h* £ T*(P$) r\Cj(P^), 

9 G C T (P). 

Now, as /i 1 £ C|(P*) nCs(P*), we have 

/i*(l',V*) = /i*(l')+ ft*(u) (219) 

ft* (Y Ao 5 )= (220) 

h x (l A & s ) = 0. (221) 

By ( |219[ ), we can deduce that 

h} (l'^AV* \{0}) ( 222 ) 
Due to the topology constraint (for the links {a s , 6 S , s £ S}), 

hi(a s ,b s \l,<j>)=0. (223) 

Hence, 

fc* (05,65, l' AV*\{0}) =0 (224) 
On the other hand, the decoding constraint hs £ Cd(P) (for the sink node rf), we have 

I 05, 65) =0. (225) 
Together with ( |220| >-( p2T| ), (and with the fact that h* is a polymatroid) ( |225[ ) implies that 

ft* (as) > (226) 

> h\l). (227) 
By the upper bounds on h^(ag) and h^(bg) in ( |216[ )-( |2"T7] i, we can in fact prove that 

h} {a s A b s ) = (228) 

^(05,65) = X>*(0 + /l *(M)- (229) 
se5 

ft* {a s , s e 5) = ^ i a * ) (230) 

ht(a s ) = h t (b s ) = \(s), Vs£S. (231) 
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Now, for any s € S, 

g{s) = h t {a s |V*\{0}) 
( = } ^(a s ) 
= A( S ) 

where (i) follows from ( |224| i. Also, for any e £ £, 

g(e) = fct(e|V*\ {<£}) 

< ^(e) 

< w(e). 

Therefore, (A, w) £ CL(proj p [g]). 
By definition, 

ff (S) = h*(as I V* \ {0}) ® ft* (as) = > 5>*(a s | V* \ {0}) = > g(S) (232) 

where (i) is due to ([224). Thus, g e C\(P). 
Our last step is to prove that g E Cd(P)- As 



h t (c s ) + h t (e s )<J2 X ^ = ht ( 1 '^ 
the decoding constraint h$(l \ c s , e s ) = (for the receiver ip s ) implies that 

h t (c s \l')= (233) 

and 

h i (c s ) = X(s) (234) 

By (|220l)-(p2T) and ([229) 

/i*(c a A a s ) = h}(c s A fo s ) = h l (a s Ab s ) = (235) 

On the other hand, by ( |2T9] >, ft* ^ A l', ^ =0. By ( |223] i and ( |233] l, we have ft* (7^ A a s ,b s ,c s ) = 0. Together 
with one of the topology constraint (for the link c s ) 

h}{c s I a s ,b s ,j s ) = 0, (236) 

we have 

fc*(c, |a„6,)= 0. (237) 

By ( |234| i and ( |231| l, we can prove that 

ft^Cs I a s ,6 s ) = ^(ag I c s ,6 s ) = ^(bs \ a s ,c s ) = 0. (238) 
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Similarly, focusing on the receiver t s u , the decoding constraint h ^1 | w StU1 e s ^j = and that h^(w s u ) < A(s) 
imply that 

hHw a , u | l') = (239) 

and 

h i {w s<u ) = X(s). (240) 
By p2l) , h x {w s . u A b s ) = 0. On the other hand, by ([224) and ( |2"20") 

h\l A as, V* \ {</>}) = , as A V* \ {0}) = 0. (241) 
Furthermore, by the topology constraint 

h i (d s , u \a s ,V i \{4>})=0 (242) 

Therefore, together with ( |239| l, we have 

, w StU A d 8jU , as) = 0. (243) 

Similarly, by < [22"3) and ( [228) , 

h*(6 5 Aa 5 ,V*\{0}) = ft*(a s ,& s A V*\ {</>}) = (244) 

and hence 

h t (b s /\d. lU ,a 8 ) = Q. (245) 

Consequently, we have 

/i* (iu S)U A 6 a ) = /i*(iy s ,„ A = ^( d a,u A 6 S ) = 0. (246) 
On the other hand, by the topology constraint, 

h*(w„ tU | b s ,d s , u ,e S}U ) = 0. (247) 
Again, by ([223), fo* M.', 6 S , d S) „ A S) „) = 0. Then, by ( p47) , we can prove that 

ft t K,u|&.,d.,«)=0. (248) 
Using ( |24"0") , ([246) and ( |248) and ( |2T2) and p3~T) , we can prove that 

^(ws.u | & s ,4,u) = to*(ds,u I & s ,w s ,u) = ^Os I w s ,ud s ,u) = (249) 

and 

/i t «„) = A(s). (250) 
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Finally, notice that 

(b s A a s d Sl u) — (251) 
h} (w s<u , c s A a s d s>u ) ( = (252) 
ft* (b s A Ws c s ) ( = } (253) 
where (i), (ii) and (Hi) follow respectively ( |243> , ( |243| l and pIT) . By ( |238] l and ( |249] >, we have 

/i^fcs I w s ,„c s , a s d s , u ) = (254) 
^(asds.u | & s ,w SiU c s ,) = 0. (255) 
Together with ( |25 1 | l-( |2"5"3"l ), we can prove that 

htfasds^) = = A(s) = h x (d SiU ). 

Thus, /^(dg | d s . u ) = and 

g{a s | in(«)) - h* (a s | in(tt), V* \ {</>}) = 0. 

Thus, <? € Cd(P) and the first claim is proved. 

We will now prove the second claim. The idea of the proof is similar to that in the incremental multicast scenario. 
Suppose (A, u) is O-achievable with respect to P. By definition, there exists a sequence of zero-error network codes 

{YfJeSuS} 

for P, and a sequence of positive constants c„ such that 

lim c n H(Y™) < lim c n H |SP(T e n )| < u(e) (256) 

n— too n—^oo 

lim c n H(Y s n ) = lim c n H\SP(Y?)\ > A(s). (257) 

n— >oo n— >oo 

Assume without loss of generality that SP(YJ') (i.e., the support of YJ 1 ) is equal to {0, . . . , |SP(Y^ n )| — 1}. For 
each n, construct the following set of random variables 

{U f Je 5*uf*UV*} 

such that for all e € £, s € 5 and it G -D(s), 

1) E7£ = E7£ u - F s "; 

2) [/ e n = F e "; 

3) t/^ 1 is uniformly distributed over SP(YJ, n ) for all s 6 5 and that 

if (E7£ , U%, Ul) = J] [/£ + (C/ Q " s , l£) ; 
fes 

4) = U£ = Y£+Y b n s mod |SP(F S ")|; 

5) UZHY5,ieS\s) 

6) [/" = (y«,s <= 5); 
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7) [7™ = 1 (i.e., U v = 1 is a deterministic random variable) for all v e V*; 
Again, it can be verified directly that 

{Uf,f e 5* U V*} 

is a strongly secure zero-error network codes for P+. Consequently, T(A, ui) is O-achievable with respect to P*, 
subject to strong secrecy constraint. 
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